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PREFACE 


This  Research  Note  Is  one  of  three  that  present  the  results  of  a 
literature  review  conducted  as  part  of  Project  A,  a  large-scale,  multiyear 
research  program  Intended  to  Improve  the  selection  and  classification 
system  for  Initial  assignment  of  persons  to  U.S.  Army  Military  Occupational 
Specialties.  The  research  is  sponsored  by  the  U.S.  Army  Research  Institute 
for  the  Behavioral  and  Social  Sciences. 

The  three  Research  Notes  each  cover  a  separate  domain  of  measures  of 
human  abilities,  Interests,  and  other  attributes.  Their  titles  are: 

o  Literature  Review:  Cognitive  Abilities  --  Theory,  History, 
and  Validity  by  Jody  L.  Toquam,  VyVy  A.  Corpe, 

Marvin  0.  Ounnette,  and  Margaret  A.  Keyes. 

o  Literature  Review:  Validity  and  Potential  Usefulness  of 
Psychomotor  Ability  Tests  for  Personnel  Selection  and 
Classification  by  Jeffrey  J.  McHenry  and  Sharon  R.  Rose. 

o  Literature  Review:  Utility  of  Temperament,  Biodata,  and 
Interest  Assessment  for  Predicting  Job  Performance  by 
Leaetta  M.  Hough  (ed.). 

The  findings  presented  In  these  documents  were  used  in  the  development 
of  a  battery  of  new  tests  and  Inventories  for  use  In  Project  A.  The  focus 
of  that  development  effort  was  to  Identify  abilities  and  other  human  attri¬ 
butes  that  seemed  "best  bets"  for  predicting  soldiers'  job  performance,  and 
then  to  develop  new  measures  for  those  attributes.  These  Research  Notes, 
however,  have  usefulness  beyond  that  particular  applied  problem.  Many 
Issues  pertinent  to  the  measurement  and  use  of  human  abilities  are  de¬ 
scribed  and  discussed  In  each  of  these  compilations. 

The  Research  Notes  describe  the  results  and  findings  of  the  literature 
review,  but  do  not  describe  the  literature  search  process  Itself.  There¬ 
fore,  we  provide  a  description  of  that  process  here. 

The  literature  search  was  conducted  by  three  research  teams  from  the 
Personnel  Decisions  Research  Institute.  Each  team  was  responsible  for  one 
of  the  three  fairly  broadly  defined  areas  of  human  abilities  or  character¬ 
istics  that  are  reported  In  the  Research  Notes:  cognitive  abilities;  psych¬ 
omotor  abilities;  and  non-cognltlve  characteristics  such  as  vocational 
Interests,  biographical  data,  and  measures  of  temperament.  While  these 
domains  were  convenient  for  purposes  of  organizing  and  conducting  litera¬ 
ture  search  activities,  they  were  not  used  as  (nor  Intended  to  be)  a  final 
taxonomy  of  possible  predictor  measures. 

The  major  part  of  the  literature  search  was  conducted  In  late  1982  and 
early  1983.  Within  each  of  the  three  areas,  the  teams  carried  out  essen¬ 
tially  the  same  steps: 

1.  Compile  an  exhaustive  list  of  reports,  articles,  books,  or 
other  sources  that  were  possibly  relevant  to  Project  A. 
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2.  Review  each  Item  and  determine  Its  relevancy  for  the  pro¬ 
ject's  general  purposes  by  examining  the  title  and  abstract 
(or  other  brief  review). 

3.  Obtain  the  sources  Identified  In  the  second  step  as  being 
relevant. 

4.  For  relevant  materials,  conduct  a  thorough  review  and  trans¬ 
fer  applicable  Information  onto  special  review  forms  devel¬ 
oped  for  the  project. 

In  the  first  step,  several  activities  were  designed  to  Insure  that  the 
list  would  be  as  comprehensive  as  possible.  Several  computerized  searches 
of  relevant  data  bases  were  performed.  Across  all  three  ability  areas, 
more  than  10,000  potential  sources  were  Identified  via  the  computer 
searches.  (Of  course,  many  of  these  sources  were  Identified  as  relevant  In 
more  than  one  area,  and  were  thus  counted  more  than  once.) 

In  addition  to  the  computerized  searches,  reference  lists  were  ob¬ 
tained  from  recognized  experts  In  each  area,  emphasizing  the  most  recent 
research  In  the  field.  Several  annotated  bibliographies  were  obtained  from 
military  research  laboratories.  Finally,  the  last  several  years'  editions 
of  research  journals  that  are  frequently  used  In  each  ability  area  were 
scanned,  as  were  more  general  sources  such  as  textbooks,  handbooks,  and 
appropriate  chapters  In  the  Annual  Review  of  Psychology  (which  reviews  the 
most  recent  research  In  a  number  of  conceptually  distinct  areas  of  psy¬ 
chology). 

The  majority  of  the  Items  Identified  In  the  first  steps  proved  not 
relevant  to  the  applied  purpose--that  Is,  the  Identification  and  develop¬ 
ment  of  promising  measures  for  personnel  selection  In  the  U.S.  Army.  These 
nonrelevant  sources  were  weeded  out  In  Step  2. 

The  relevant  sources  were  obtained  and  reviewed,  and  team  members 
completed  two  forms  for  each  source:  an  Article  Review  form  and  a  Predic¬ 
tor  Review  form  (several  of  the  latter  could  be  prepared  for  each  source). 
These  forms  were  designed  to  capture,  In  a  standard  format,  the  essential 
Information  about  the  reviewed  sources,  which  varied  considerably  In  their 
organization  and  reporting  styles. 

The  Article  Review  form  contained  eight  sections:  citation,  abstract, 
list  of  predictors  (keyed  to  the  Predictor  Review  forms),  description  of 
criterion  measures,  description  of  sample(s),  description  of  methodology, 
other  results,  and  reviewer’s  comments.  The  Predictor  Review  form  con¬ 
tained  seven  sections:  description  of  predictor,  reliability,  norms/ 
descriptive  statistics,  correlations  with  other  predictors,  correlations 
with  criteria,  adverse  impact/dlfferentlal  valldlty/test  fairness,  and 
reviewer’s  recommendations  (about  the  usefulness  of  the  predictor).  Each 
predictor  was  tentatively  classified  Into  an  Initial  working  taxonomy  of 
predictor  constructs. 
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The  Review  forms  and  the  actual  sources  that  had  been  located  were 
used  In  two  primary  ways  for  Project  A  purposes.  First,  three  working 
documents  were  written,  one  for  each  of  the  three  areas.  These  working 
documents  later  evolved  Into  the  three  Research  Notes  named  above.  These 
documents  Identified  and  summarized  the  literature  with  regard  to  Issues 
Important  to  the  research  being  conducted,  the  most  appropriate  organiza¬ 
tion  or  taxonomy  of  the  constructs  in  each  area,  and  the  validities  of  the 
various  measures  for  different  types  of  job  performance  criteria.  Second, 
the  predictors  Identified  In  the  review  were  subjected  to  further,  struc¬ 
tured  scrutiny  In  order  to  select  tests  and  Inventories  for  use  In  later 
activities  of  Project  A. 

As  a  set,  the  three  Research  Notes  should  provide  a  valuable  resource 
for  scientists,  researchers,  and  personnel  practitioners  Interested  In 
measurement  of  Individual  differences  In  humans  for  various  applied  pur¬ 
poses,  but  especially  for  selection  and  classification. 
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SECTION  I 


INTRODUCTION 
Background  and  Approach 

The  psychomotor  domain  Includes  a  broad  class  of  abilities  Involved  In 
coordlnatlve,  manipulative,  repetitive  and/or  precise  body  or  limb  move¬ 
ments  (Imhoff  &  Levine,  1981).  The  abilities  represented  In  the  psycho- 
motor  domain  vary  greatly  In  terms  of  speed,  precision,  and  cognitive- 
perceptual  requirements.  All  of  the  psychomotor  abilities,  however,  in¬ 
volve  motor  movement.  Indeed,  as  the  domain  name  "psychomotor"  suggests, 
motor  movement  Is  a  defining  characteristic  of  these  abilities. 

Given  the  perceptual -cognitive  component  Involved  In  many  psychomotor 
abilities,  It  should  not  be  surprising  to  discover  that  the  distinction 
between  the  cognitive-perceptual  and  psychomotor  domains  Is  not  always  very 
clear.  For  example,  some  of  the  abilities  often  Included  In  the  cognitive 
domain  (e.g.,  reaction  time)  Involve  a  very  minor  movement  component. 
Similarly,  many  psychomotor  abilities  (e.g.,  rate  control,  control  preci¬ 
sion,  and  multilimb  coordination)  are  highly  dependent  on  cognitive- 
perceptual  processes  such  as  attention,  encoding,  comparing,  and  movement 
judgment.  For  purposes  of  this  part  of  the  report,  we  focus  on  only  those 
abilities  that  Involve  a  major  manipulative  or  movement  component. 

This  Is  quite  similar  to  the  distinction  Imhoff  and  Levine  drew  be¬ 
tween  psychomotor  and  perceptual  abilities  In  their  1981  literature  review. 
Nevertheless,  Imhoff  and  Levine  chose  not  to  retain  this  distinction 
throughout  most  of  their  paper.  Instead,  they  combined  the  perceptual  and 
psychomotor  domains  Into  a  single  perceptual -motor  domain.  This  contrasts 
with  the  approach  followed  here. 

The  approach  here  also  differs  slightly  from  Fleishman’s  conceptuali¬ 
zation  of  the  limits  of  the  psychomotor  domain,  which  Included  reaction 
time  and  response  orientation  (l.e.,  choice  reaction  time)  as  two  psycho¬ 
motor  ability  dimensions.  In  our  view,  performance  on  measures  of  reaction 
time  and  response  orientation  Is  determined  almost  entirely  by  cognitive 
processes.  During  a  typical  assessment  of  these  abilities,  a  subject’s 
only  motor  task  Is  to  lift  a  finger  from  a  button.  This  scarcely  ap¬ 
proaches  the  richness  of  psychomotor  performance  (i.e.,  coordinative,  ma¬ 
nipulative,  repetitive,  and/or  precise  body  or  limb  movements)  described 
above. 

Time  sharing  (or  divided  attention)  measures  are  very  frequently 
assessed  using  psychomotor  tasks  (e.g.,  tracking  tasks).  To  date,  many 
researchers  who  have  attempted  to  assess  time  sharing  have  totally  con¬ 
founded  time  sharing  with  the  abilities  underlying  performance  on  the 
Individual  tasks  used  In  the  time  sharing  measure  (e.g.,  Braune  &  Wickens, 
1983;  Damos,  1978;  Owens,  Goodman,  Pollack,  &  Braune,  1983;  cf.  Ackerman, 
Schneider,  &  Wickens,  1982).  Most  psychologists,  though,  would  agree  with 
Camos  (1978),  who  stated  that  the  ability  to  time  share  Is  dependent  on  a 
subject’s  attentlonal -processing  capacity.  Therefore,  time  sharing  ability 
Is  not  included  In  the  section  on  psychomotor  abilities. 
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Overview  of  Report 


The  balance  of  this  part  of  the  report  Is  devoted  to  a  review  of  the 
literature  on  psychomotor  abilities,  with  an  emphasis  on  the  use  of  psycho¬ 
motor  abilities  as  predictors  of  training  and  job  performance. 

First,  the  abilities  represented  in  the  psychomotor  domain  are  identi¬ 
fied  and  defined.  In  this  section,  there  Is  also  a  brief  history  of 
psychomotor  ability  research.  At  the  conclusion  of  this  section,  a  taxon¬ 
omy  of  psychomotor  abilities  is  presented.  This  taxonomy  is  used  in  a 
subsequent  presentation  of  the  validity  evidence  for  psychomotor  abilities. 

The  next  section  contains  a  brief  review  and  summary  of  research  on 
motor  skills  learning  and  acquisition.  The  review  of  motor  skills  learning 
Includes  a  description  of  common  experimental  paradigms,  a  summary  of  major 
research  findings,  and  a  listing  of  some  of  the  parameters  which  are  be¬ 
lieved  to  govern  motor  skills  learning.  There  Is  also  a  discussion  of 
Fleishman’s  views  regarding  the  differences  between  psychomotor  skills  and 
psychomotor  abilities.  The  section  concludes  with  a  presentation  of  sev¬ 
eral  competing  theories  of  the  motor  skills  learning  process. 

The  use  of  psychomotor  tests  In  applied  psychological  research  is 
described  In  the  following  section.  That  section  begins  with  a  summary  of 
the  content  validity  evidence  for  psychomotor  measures.  Next,  there  is  a 
summary  of  the  criterion-related  validity  research  using  the  same  set  of 
criterion  constructs  (i.e.,  educational  and  school  achievement,  training 
performance,  job  proficiency,  and  job  Involvement/withdrawal)  used  In  the 
validity  summary  for  the  cognitive-perceptual  domain.  Separate  validity 
summary  tables  are  presented  for  nine  different  job  types  for  both  military 
and  non-military  jobs  and  subjects.  In  addition,  because  a  large  per¬ 
centage  of  the  military  research  has  involved  pilots  and  pilot  trainees,  a 
separate  summary  of  these  data  is  provided. 

Following  the  presentation  of  the  validity  evidence,  a  section  is 
devoted  to  four  major  issues  in  current  research  on  psychomotor  abilities: 
stability  of  psychomotor  measures,  utility,  group  differences,  and  the  use 
of  single-  vs.  multiple-construct  predictor  measures.  The  discussion  of 
stability  focuses  primarily  on  the  reliability  of  the  rank  ordering  of 
individuals’  psychomotor  performance  as  psychomotor  tasks  are  practiced  and 
learned.  The  discussion  of  the  utility  of  psychomotor  tests  covers  issues 
such  as  administrative  efficiency  and  cost-effectiveness.  Different  test 
administration  formats  are  examined  (e.g.,  paper-and-pencil  measures,  ap¬ 
paratus  measures,  and  computerized  measures).  Also,  the  intercorrelations 
between  psychomotor  and  cognitive-perceptual  abilities  are  presented  in 
order  to  determine  the  magnitude  of  the  unique  variance  which  psychomotor 
measures  might  contribute  to  the  prediction  of  job  performance  and  related 
criteria.  With  respect  to  group  differences,  the  evidence  is  scant.  There 
Is  a  brief  review  of  the  few  studies  which  compare  the  validity  of  psycho¬ 
motor  measures  for  blacks  and  whites.  In  addition,  two  validity  studies 
conducted  with  samples  of  foreign  student  pilots  are  described.  Finally  in 
this  section,  there  is  an  examination  of  the  controversy  surrounding  multi¬ 
ple-construct  psychomotor  predictor  measures  and  the  practical  utility  of 
such  measures  in  large-scale  selection  research. 
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The  last  section  of  this  report  contains  a  summary  of  the  previous 
sections.  The  focus  of  this  summary  Is  the  Implications  of  the  psychomotor 
literature  for  the  ongoing  Army  selection  and  classification  research 
project.  Recommendations  are  made  In  light  of  the  goals  and  objectives  of 
this  research  effort. 
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SECTION  II 


A  TAXONOMY  OF  THE  PSYCHOMOTOR  ABILITY  DOMAIN 

Historical  Overview 

Much  of  what  we  know  about  the  taxonomy  of  the  psychomotor  ability 
domain  Is  based  on  a  series  of  studies  of  airmen  and  basic  trainee  airmen 
by  Fleishman  and  his  associates.  Most  of  this  research  is  actually  quite 
recent.  Even  though  psychologists  have  been  administering  psychomotor 
tests  since  the  1890s  and  have  been  using  psychomotor  tests  to  predict 
pilot  training  success  and  job  performance  since  before  World  War  I,  it  was 
not  until  1953  that  Fleishman  began  a  systematic  Investigation  of  the 
structure  of  the  abilities  comprising  the  psychomotor  domain.  This  is 
extremely  slow  progress  In  comparison  with  the  progress  made  by  psycholo¬ 
gists  Interested  In  the  taxonomy  of  cognitive-perceptual  abilities  (e.g., 
Spearman,  1927;  Thurstone,  1938). 

Passey  and  McLaurin  (1966)  have  reviewed  the  literature  on  early 
aircrew  selection.  Their  description  of  the  research  conducted  prior  to 
World  War  II  suggests  a  lack  of  concern  for  either  predictor  or  construct 
explication.  On  the  criterion  side,  Passey  and  McLaurin  report  that  there 
was  very  little  research  Into  what  a  pilot  actually  does  on  the  job.  It 
was  not  until  1941  that  psychologists  attempted  any  sort  of  a  systematic 
job  analysis  for  pilots  (Miller,  1947).  This  basic  job  analytic  work 
enabled  Army  Air  Forces  (AAF)  psychologists  to  make  tremendous  strides  in 
psychomotor  predictor  development  during  World  War  II.  Whereas  previous 
selection  of  psychomotor  predictors  had  relied  heavily  on  arm  chair  analy¬ 
sis  concerning  the  appropriate  psychomotor  predictors  for  different  jobs 
(cf.  Vlteles,  1942),  AAF  psychologists  such  as  Melton  (1947)  and  Miller  now 
had  the  job  analysis  Information  needed  to  make  a  more  objective  deter¬ 
mination  of  the  psychomotor  abilities  and  tests  which  might  be  relevant  to 
pilot  performance.  Melton,  In  particular,  was  instrumental  in  developing  a 
number  of  new  apparatus  tests.  These  tests,  which  became  part  of  the 
Aircrew  Classification  Battery  (ACB),  were  modified  and  Improved  frequently 
throughout  the  course  of  the  war. 

In  the  Immediate  postwar  period,  several  events  slowed  the  progress  of 
psychomotor  research.  First,  the  end  of  the  war  greatly  reduced  the  need 
for  military  manpower  and  personnel  research.  In  addition,  the  separation 
of  the  Air  Force  from  the  Army  fragmented  the  aggressive  research  efforts 
begun  during  the  war. 

Nevertheless,  the  Army,  the  Air  Force,  and  the  Navy  continued  to 
collect  important  data.  These  data  included  cognitive-perceptual  measures 
as  well  as  measures  from  psychomotor  paper-and-pencil  and  apparatus  tests. 
The  accumulation  of  large  data  bases  allowed  researchers  to  use  multivar¬ 
iate  statistical  techniques  to  Identify  the  underlying  factor  structure  of 
aircrew  selection  test  batteries.  Guilford  and  Lacey  (1947),  Dudek  (1948, 
1949),  Michael  (1949),  and  Roff  (1951)  carried  out  factor  analyses  based  on 
data  collected  during  the  war,  while  Roff  (1953)  factor  analyzed  data 
collected  immediately  following  the  war  in  a  joint  Navy-Air  Force  study. 

At  the  same  time,  a  body  of  factor  analytic  studies  of  psychomotor 
test  batteries  was  accumulating  from  nonmilitary  research  (e.g.,  Harrell, 
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1940;  Seashore,  Buxton,  &  McCollom,  1940;  Wittenborn,  1945).  There  were  a 
number  of  similarities  between  the  factors  Identified  in  these  nonmilitary 
studies  and  those  Identified  In  the  military  studies.  The  similarities 
were  first  summarized  In  a  literature  review  by  Fleishman  (1953). 

Fleishman  Identified  the  following  10  psychomotor  factors  as  prime  candi¬ 
dates  for  future  research: 


1.  Reaction  Time.  The  speed  with  which  a  subject  can  make  a  simple, 
predetermined  response  upon  presentation  of  a  stimulus. 

2.  Tapping.  The  speed  with  which  a  subject  can  oscillate  his 
fingers  or  his  arm,  independent  of  any  eye-hand  coordination. 

3.  Psvchomotor  Coordination.  The  integration  of  muscle  movements 
and/or  the  coordination  between  eye  and  muscle  movements. 

4.  Manual  Dexterity.  The  ability  to  make  skillful,  speeded  arm  or 
hand  movements. 

5.  Finger  Dexterity.  The  ability  to  make  skillful,  fine,  speeded 
object  manipulations  with  the  fingers. 

6.  Psvchomotor  Precision.  An  ambiguously  defined  ability  which 
involves  speeded,  fine  object  manipulations  with  the  fingers,  but 
seemingly  embraces  more  eye-hand  coordination  than  finger  dex¬ 
terity. 

7.  Steadiness.  The  ability  to  make  extremely  coordinated,  accurate 
movements  in  which  the  need  for  speed  and  strength  are  minimized. 

8.  Motor  Kinesthesis.  The  ability  to  make  a  compensatory  motor 
response  in  order  to  keep  a  unit  balanced  in  a  given  position. 

9.  Aiming.  The  ability  to  execute  a  series  of  movements  requiring 
eye-hand  coordination. 

10.  Ambidexterity.  The  ability  to  perform  rapid,  fairly  accurate 
movements  with  one’s  non-preferred  hand. 


In  the  sections  that  follow,  the  psychomotor  ability  factors  above  and 
additional  psychomotor  ability  factors  identified  in  subsequent  research 
are  discussed  and  described  in  terms  of  the  psychomotor  tests  which  load 
and  define  the  ability  factors.  To  facilitate  understanding  of  these 
psychomotor  tests,  brief  summaries  of  many  of  the  most  commonly  used  psy¬ 
chomotor  tests  are  provided  in  Appendix  A.  The  summaries  include:  the  name 
of  the  test;  the  name  and/or  description  of  the  ability  construct  which  the 
test  measures;  a  brief  description  of  the  task  which  the  subject  is  re¬ 
quired  to  perform  for  the  test,  generally  accompanied  by  a  picture  of  the 
apparatus  for  apparatus  tests  or  a  sample  of  test  items  for  paper-and- 
pencil  tests;  a  summary  of  administration  and  scoring  instructions;  and  a 
summary  of  the  reliability  and  validity  evidence  for  the  test. 
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HelshmarTs  Factor  Analytic  Work 


During  the  mid  and  ‘late  1950s,  Fleishman  and  his  associates  continued 
their  research  into  the  taxonomy  of  the  psychomotor  ability  domain. 
Fleishman  (1967)  has  stated  that  the  primary  impetus  for  these  investiga¬ 
tions  was  to  uncover  the  ability  factors  common  to  both  psychomotor  tests 
and  pilot  performance.  Fleishman  found  it  at  least  somewhat  surprising 
that  many  of  the  valid  psychomotor  predictors  of  pilot  performance  bore 
little  superficial  resemblance  to  the  tasks  pilots  performed  on  the  job.  A 
prime  example  was  the  Rotary  Pursuit  Test  (see  Appendix  A).  A  subject’s 
task  in  this  test  Is  to  keep  a  metal  stylus  In  contact  with  a  small  metal ic 
target  which  Is  set  on  a  rotating  disk.  The  disk  resembles  a  phonograph 
turntable  In  both  appearance  and  operation.  Fleishman  (1954b)  reported  a 
validity  of  .27  for  the  Rotary  Pursuit  Test  as  a  predictor  of  graduation 
from  pilot  training.  This  and  other  correlations  between  psychomotor  tests 
and  pilot  performance  Indicated  to  Fleishman  that  there  must  be  a  common 
set  of  abilities  underlying  performance  on  both  the  tests  and  the  pilot’s 
job. 


Fleishman  was  certainly  not  the  first  who  sought  to  identify  the 
abilities  underlying  psychomotor  and  pilot  performance.  As  was  noted 
above,  before  Melton,  Miller,  and  their  colleagues  designed  the  Airman 
Classification  Battery  (ACB)  psychomotor  tests  during  World  War  II,  they 
attempted  to  do  a  careful  analysis  of  pilots’  jobs  in  order  to  Identify  the 
abilities  contributing  to  successful  pilot  performance.  They  then  con¬ 
structed  their  new  psychomotor  apparatus  tests  specifically  to  assess  these 
hypothesized  underlying  abilities.  Fleishman  was  simply  extending  the  work 
of  Melton  and  Miller  by  using  factor  analysis  to  Identify  and  define  these 
psychomotor  abilities  more  precisely.  In  these  efforts,  Fleishman  was 
greatly  aided  by  the  research  which  preceded  his.  For  example,  his  1953 
review  of  previous  factor  analytic  research  cited  ten  previously  suggested 
factors  meriting  further  study.  Fleishman  also  drew  heavily  upon  the 
apparatus  tests  of  the  ACB  (Melton,  1947)  In  the  conduct  of  his  research. 

Between  1953  and  1962,  Fleishman  authored  or  co-authored  numerous 
studies  on  the  taxonomy  of  the  psychomotor  ability  domain.  In  almost  all 
of  these  studies,  Fleishman  administered  a  large  battery  of  psychomotor 
tests  (usually  both  apparatus  and  paper-and-pencil  tests)  to  a  sample  of 
pilot  or  airmen  trainees.  In  some  of  these  studies,  the  psychomotor  tests 
were  supplemented  by  a  battery  of  physical  performance  or  cognitive- 
perceptual  ability  tests.  Fleishman  typically  factor  analyzed  the  correla¬ 
tion  matrix  for  the  test  battery  using  Thurstone’s  (1947)  centroid  method. 
The  resulting  factor  pattern  matrix  was  then  rotated  to  achieve  simple 
structure.  In  the  following  paragraphs,  eight  such  studies  are  summarized. 

Fleishman  and  Hempel  (1954a)  administered  a  battery  of  15  dexterity 
tests  to  a  sample  of  400  basic  trainee  airmen.  These  tests  Included  a 
number  of  widely  used  apparatus  tests  (e.g.,  O’Connor  Finger  Dexterity, 
Purdue  Pegboard,  Minnesota  Rate  of  Manipulation,  and  Santa  Ana  Finger 
Dexterity)  as  well  as  several  paper-and-pencil  tests  designed  to  measure  a 
subject's  speed  and  accuracy  In  making  marks  inside  figures  and  tracing 
lines  (e.g.,  Large  Tapping,  Small  Tapping,  Tracing,  and  Square  Marking). 
Fleishman  and  Hempel ’s  goal  was  to  identify  the  factors  underlying  manipu¬ 
lative  performance.  Via  factor  analysis,  they  were  able  to  extract  and 
identify  five  ability  factors:  finger  dexterity,  manual  dexterity, 
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wrist-finger  speed,  aiming,  and  positioning.  Three  of  these  abilities-- 
finger  dexterity,  manual  dexterity,  and  a1ming--corresponded  exactly  to 
factors  Fleishman  (1953)  had  identified  in  his  literature  review.  A  fourth 
factor,  wrist-finger  speed,  was  just  a  new  name  for  the  factor  that 
Fleishman  (1953)  had  previously  called  tapping.  The  fifth  factor,  posi¬ 
tioning,  was  the  least  clearly  defined.  All  of  the  tests  loading  on  the 
positioning  factor  require  subjects  to  place  or  position  blocks,  pegs,  or 
pins  Into  snugly  fitting  holes  In  an  apparatus  board  as  quickly  as  pos¬ 
sible.  Fleishman  and  Hempel  (1954a)  noted  that  this  was  somewhat  similar 
to  aiming,  which  Involves  making  rapid,  precise  pencil  marks  Inside  a 
series  of  small  circles.  Still,  they  were  unable  to  specify  precisely  the 
unique  ability  represented  by  this  factor. 

In  a  much  larger  study,  Fleishman  (1954a)  administered  27  apparatus 
tests  and  11  printed  tests  to  a  sample  of  400  basic  trainee  airmen. 
Fleishman’s  goal  was  "to  verify  empirically  in  a  single  study  practically 
all  of  the  psychomotor  factors  previously  Identified  In  the  separate 
studies,  together  with  any  new  factors  that  might  emerge"  (p.  438).  The 
test  battery  that  Fleishman  used  included  a  number  of  AAF  apparatus  tests 
used  durlnq  World  War  II  (Melton,  1947)  as  well  as  the  dexterity  tasts  used 
in  the  Fleishman  and  Hempel  (1954a)  study  cited  above.  Of  the  nine  well- 
defined  factors  which  emerged  from  the  factor  analysis,  seven  had  been 
Identified  previously:  wrist-finger  speed,  finger  dexterity,  aiming,  arm- 
hand  steadiness,  reaction  time,  manual  dexterity,  and  psychomotor  coordina¬ 
tion.  The  remaining  two  factors  Fleishman  called  rate  of  arm  movement  and 
spatial  relations.  The  tests  defining  rate  of  arm  movement  Included  Ten 
Target  Aiming,  Two-Plate  Tapping,  and  Rotary  Aiming.  All  three  tests 
require  subjects  to  make  gross,  rapid  arm  movements.  The  Importance  of 
accuracy  Is  reduced  or  minimized  in  all  of  these  tests.  Tests  loading  the 
spatial  relations  factor  Included  Discrimination  Reaction  Time  and  Complex 
Coordination.  Both  of  these  tests  require  subjects  to  determine  the  ap¬ 
propriate  spatial  direction  for  their  responses  In  accordance  with  the 
stimulus  presented. 

Hempe1  and  Fleishman  (1955)  subsequently  factor  analyzed  a  battery  of 
46  manipulative,  paper-and-pencil ,  and  physical  performance  tests  adminis¬ 
tered  to  a  sample  of  400  basic  trainee  airmen.  The  17  manipulative  tests 
Included  a  number  of  placement,  assembly,  and  turning  tasks.  Most  of  the 
six  printed  tests  required  the  subject  to  draw  lines  or  place  dots  or  X’s 
Inside  figures.  The  remaining  23  physical  performance  tests  were  assess¬ 
ments  of  physical  fitness  (e.g.,  number  of  chin-ups  completed  in  a  fixed 
period  of  time).  Hempel  and  Fleishman  failed  to  indicate  why  they  at¬ 
tempted  to  factor  analyze  such  a  mixed  lot  of  tests.  Not  surprisingly,  the 
physical  performance  tests  did  not  load  on  the  same  factors  as  the  manipu¬ 
lative  and  paper-and-pencil  tests.  Of  the  14  interpretable  factors,  only 
four  were  relevant  to  the  psychomotor  domain:  aiming,  arm-hand  steadiness, 
manual  dexterity,  and  finger  dexterity.  All  four  of  these  ability  factors 
had  emerged  In  previous  factor  analytic  studies. 

A  battery  of  11  apparatus  tests  and  nine  paper-and-pencil  tests  was 
administered  by  Fleishman  and  Hempel  (1955)  to  a  sample  of  264  basic  trai¬ 
nee  airmen.  The  apparatus  tests  Included  a  discrimination  reaction  time 
test,  four  simple  reaction  time  tests,  two  dexterity  tests,  two  complex 
psychomotor  tests  taken  from  the  ACB,  and  two  speed  of  arm  movement  tests. 
The  nine  printed  tests  Included  a  vocabulary  test,  a  current  affairs  test, 
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three  tests  related  to  mechanical  knowledge,  and  four  tests  of  perceptual  - 
visual  ability.  Nine  factors  emerged  from  a  factor  analysis  of  this  test 
battery.  Fleishman  and  Hempel  regarded  six  of  these  as  psychomotor  ability 
factors:  discrimination  reaction  time,  reaction  time,  psychomotor  coordina¬ 
tion,  rate  of  arm  movement,  finger  dexterity,  and  spatial  relations.  The 
discrimination  reaction  time  factor  was  test-specific;  the  only  variables 
loading  on  this  factor  we»*e  scores  taken  from  the  Discrimination  Reaction 
Time  Test.  The  remaining  five  factors  have  all  been  Identified  and  de¬ 
scribed  previously. 

In  1947-1948,  over  1,000  Navy  pilot  candidates  completed  23  paper-and- 
pencll  and  apparatus  psychomotor  tests  as  part  of  a  joint  Navy-Air  Force 
research  project  noted  briefly  above  (Roff,  1953).  The  16  apparatus  tests 
were  drawn  primarily  from  the  tests  used  by  the  AAF  during  World  War  II, 
while  the  seven  printed  tests  were  Intended  to  measure  marking  speed  and 
visualization  abilities.  Fleishman  and  Hempel  (1956)  obtained  the  data  for 
these  subjects  and  submitted  the  data  to  a  factor  analysis.  The  results  of 
the  factor  analysis  were  particularly  Interesting  because  this  was  the 
largest  and  most  diverse  set  of  psychomotor  apparatus  tests  which  had  ever 
been  Included  In  a  factor  analysis.  Nine  factors  emerged:  two  psychomotor 
coordination  factors,  two  spatial  relations  factors,  Integration,  rate 
control,  perceptual  speed,  manual  dexterity,  and  visualization. 

Fleishman  and  Hempel  (1956)  Indicated  that  the  first  psychomotor 
coordination  factor  was  the  ability  to  perform  "muscular  movements  Involved 
In  making  fine,  accurate  (control)  adjustments”  (p.  100).  The  tests  which 
loaded  most  highly  on  this  factor  Included  Complex  Coordination,  Pursuit 
Confusion,  Rotary  Pursuit,  Two-Hand  Coordination,  and  Rudder  Control.  The 
tests  loading  most  highly  on  the  second  psychomotor  coordination  factor 
were  Rudder  Control,  Plane  Control,  Multidimensional  Pursuit,  Complex  Co¬ 
ordination,  and  Two-Hand  Coordination.  All  of  these  tests  Involve  the 
ability  to  coordinate  the  movement  of  two  limbs. 

The  tests  loading  on  the  first  spatial  relations  factor  all  use  com¬ 
plex  stimulus  patterns  to  cue  the  subject.  The  subject  Is  then  required  to 
Interpret  some  spatial  characteristic  of  the  stimulus  pattern  (e.g.,  the 
subject  might  have  to  judge  his  body  orientation  in  relation  to  the  orien¬ 
tation  of  the  stimulus).  After  Interpreting  this  spatial  characteristic, 
the  subject  must  make  an  appropriate  response.  According  to  Fleishman  and 
Hempel,  the  most  difficult  aspect  of  this  task  Is  the  Interpretation  of  the 
stimulus  pattern.  By  comparison,  the  determination  and  execution  of  the 
appropriate  response  are  relatively  simple.  The  tests  loading  on  this 
first  spatial  relations  factor  Included  Controls  Orientation,  Drift  Correc¬ 
tion,  Direction  Control,  and  Directional  Control.  This  first  spatial 
relations  factor  contrasted  sharply  with  the  second  spatial  relations 
factor,  which  Fleishman  and  Hempel  called  response  orientation.  The  second 
spatial  relations  factor  was  quite  similar  to  the  spatial  relations  factor 
which  Fleishman  and  his  colleagues  had  Identified  In  two  previous  studies 
(Fleishman,  1954a;  Fleishman  &  Hempel,  1955).  The  tests  loading  on  this 
factor  included  Signal  Discrimination,  Discrimination  Reaction  Time,  and 
Complex  Multiple  Reaction.  All  of  these  tests  require  subjects  to  quickly 
choose  and  execute  the  correct  response  upon  presentation  of  the  stimulus. 
Generally,  for  these  tests  the  subject  Is  presented  with  a  rather  simple 
stimulus  (e.g.,  a  single  light).  The  subject  Is  then  required  to  determine 
the  appropriate  response  from  among  a  number  of  response  alternatives.  For 
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some  tests,  there  Is  a  complex  spatial  relationship  between  the  location  of 
the  stimulus  and  the  location  or  direction  of  the  appropriate  response. 

For  other  tests,  a  complex  series  of  spatially  related  responses  are  re¬ 
quired  upon  presentation  of  the  stimulus.  Thus,  while  the  focus  of  com¬ 
plexity  for  the  first  spatial  relations  factor  was  on  the  stimulus,  the 
focus  of  complexity  for  the  second  spatial  relations  factor  was  on  the 
response. 

The  other  two  psychomotor  abilities  which  were  identified  In  this 
study  were  rate  control  and  manual  dexterity.  The  tests  loading  on  the 
rate  control  factor  Included  Rate  Control,  Single-Dimension  Pursultmeter, 
and  Compensatory  Balance.  These  tests  all  require  subjects  to  make  antici- 
pitory  control  adjustments  In  response  to  changes  in  the  speed  and  direc¬ 
tion  of  a  continuously  moving  stimulus.  The  manual  dexterity  factor  had 
been  identified  and  defined  by  Fleishman  and  his  associates  in  a  number  of 
previous  studies. 

The  remaining  three  factors  identified  by  Fleishman  and  Hempel -- 
integration,  perceptual  speed,  and  visual ization--were  cognitive-perceptual 
ability  factors. 

This  factor  analytic  study  was  particularly  important  because  It 
suggested  several  new  psychomotor  ability  factors.  First,  two  psychomotor 
coordination  factors  emerged  from  the  analysis.  The  first  was  concerned 
with  highly  controlled  movements  and  adjustments,  while  the  second  focused 
on  the  ability  to  coordinate  the  movements  of  two  or  more  limbs.  A  new, 
tentative  spatial  relations  ability  factor  was  also  identified.  The  focus 
of  this  spatial  relations  factor  was  Identifying  and  Interpreting  a  spatial 
characteristic  embedded  within  a  complex  stimulus  pattern.  In  addition, 
the  focus  of  the  second  spatial  relations  factor  was  more  clearly  specified 
as  being  the  ability  to  choose  and  execute  the  correct  response  alternative 
upon  presentation  of  a  relatively  simple  stimulus.  Finally,  a  rate  control 
ability  factor  emerged.  This  factor  was  defined  by  tests  requiring  the 
subject  to  make  continuous  anticipatory  motor  judgments  In  response  to  a 
moving  stimulus. 

In  a  follow-up  to  this  study,  Fleishman  (1958)  administered  a  battery 
of  31  psychomotor  apparatus  tests  to  a  sample  of  204  basic  trainee  airmen. 
Seven  interpretable  factors  emerged  from  the  factor  analysis:  response 
orientation  (corresponding  to  the  second  spatial  relations  factor  from 
Fleishman  &  Hempel,  1956),  fine  control  sensitivity  (corresponding  to  the 
first  psychomotor  coordination  factor  from  Fleishman  &  Hempel),  reaction 
time,  speed  of  arm  movement  (previously  called  rate  of  arm  movement),  arm- 
hand  steadiness,  multilimb  coordination  (corresponding  to  the  second  psy¬ 
chomotor  coordination  factor  from  Fleishman  &  Hempel),  and  rate  control. 
This  study  helped  confirm  many  of  the  psychomotor  ability  factors  first 
identified  by  Fleishman  and  Hempel. 

A  study  by  Parker  and  Fleishman  (1960)  further  confirmed  the  taxonomy 
of  psychomotor  ability  factors  identified  by  Fleishman  and  Hempel  (1956) 
and  Fleishman  (1958).  A  total  of  203  Air  Force  ROTC  students  at  a  large 
eastern  university  completed  29  psychomotor  apparatus  tests  and  21  psycho¬ 
motor  and  perceptual  paper-and-pencil  tests.  Eight  of  the  15  factors  which 
emerged  from  a  factor  analysis  were  defined  primarily  by  psychomotor  tests: 
control  precision  (corresponding  to  the  first  psychomotor  coordination 
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factor  from  Fleishman  &  Hempel,  and  the  fine  control  sensitivity  factor 
from  Fleishman,  1958),  speed  of  arm  movement,  manual  dexterity,  reaction 
time,  response  orientation,  arm-hand  steadiness,  finger  dexterity,  and 
multilimb  coordination. 

In  the  most  recent  factor  analytic  study  of  the  psychomotor  domain  by 
Fleishman  and  his  associates,  Fleishman  and  Ellison  (1962)  administered  a 
battery  of  22  fine  manipulative  and  dexterity  tests  to  760  airmen  who  were 
entering  one  of  three  technical  schools:  engine  mechanic,  hydraulic  mechan¬ 
ic,  and  aircraft  electrician.  The  battery  Included  nine  paper-and-pencll 
tests  and  13  apparatus  tests.  Five  Interpretable  factors  emerged  from  a 
factor  analysis:  wrist-finger  speed,  manual  dexterity,  finger  dexterity, 
aiming,  and  speed  of  arm  movement.  All  five  of  these  factors  had  emerged 
In  at  least  one  previous  study  (Fleishman,  1954a). 

Table  1  provides  a  list  of  the  psychomotor  ability  factors  which 
emerged  from  each  of  the  eight  factor  analytic  studies  summarized  above. 

Of  the  13  factors  listed,  all  but  the  positioning  factor  and  the  "stimulus" 
spatial  relations  factor  (called  simply  Spatial  Relations  In  Table  1)  were 
Identified  In  at  least  two  studies.  Fleishman  (1967)  maintained  that  this 
was  strong  evidence  for  a  psychomotor  ability  taxonomy  consisting  of  11 
ability  factors.  Table  2  contains  a  list  of  these  11  abilities.  Table  2 
also  lists  marker  tests  for  each  ability  (Appendix  A  contains  a  description 
of  most  of  these  marker  tests)  and  provides  references  for  some  of  the 
factor  analytic  studies  In  which  the  ability  has  been  Identified. 

Further  evidence  for  the  validity  of  portions  of  this  taxonomy  Is 
provided  by  a  study  of  pilot  performance  (Fleishman  &  Ornsteln,  1960).  For 
this  study,  63  graduates  of  an  Air  Force  pilot  school  were  required  to 
perform  a  series  of  24  flight  maneuvers.  These  24  maneuvers  had  been 
chosen  to  be  representative  of  the  total  range  of  nonacrobatlc  maneuvers 
taught  during  pilot  training.  For  each  maneuver,  a  Dally  Progress  Record 
Sheet  (DPRS)  had  been  prepared  (Sutter,  Townsend,  &  Ornsteln,  1954).  The 
DPRS  for  each  maneuver  contained  a  list  of  Items  which  had  been  judged  to 
be  essential  to  proper  execution  of  that  maneuver.  As  each  pilot  executed 
each  maneuver,  a  trained  rater  marked  the  DPRS  to  Indicate  any  errors  made 
by  the  pilot.  The  pilot’s  score  on  the  maneuver  was  the  total  number  of 
errors  he  made  on  the  Individual  Items.  Each  pilot  was  rated  four  times  on 
each  maneuver.  Fleishman  and  Ornsteln  subjected  the  error  scores  to  a 
factor  analysis  In  order  to  Identify  the  factors  underlying  pilot  perfor¬ 
mance.  Six  factors  emerged  from  the  analysis.  Four  of  these  performance 
factors  paralleled  ability  factors  previously  identified  by  Fleishman  and 
his  associates:  control  precision,  multllimb  coordination,  response  orien¬ 
tation,  and  rate  control.  The  other  two  factors,  spatial  orientation  and 
kinesthetic  discrimination,  were  similar  to  previously  identified  per¬ 
ceptual  factors. 

Criticisms  of  Fleishman’s  Psychomotor  Ability  Taxonomy 

Fleishman’s  taxonomy  of  psychomotor  abilities  has  been  criticized  by  a 
number  of  researchers  over  the  past  20  years.  The  criticisms  center  mainly 
on  his  use  of  factor  analysis  to  identify  ability  dimensions. 

Factor  Analysis  vs.  Molar  Correlational  Analysis.  Jones  (1960,  1962) 
has  been  highly  critical  of  the  use  of  factor  analysis  to  Identify  factors 
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Aiming  The  ability  to  make  very  precise,  accu-  Small  Tapping  Test  Fleishman  l  Hempel 

rate  movements  under  highly  speeded  Trace  Tapping  II  1954a 

conditions.  This  ability  is  dependent  Fleishman,  1954a 

jpon  very  precise  eye-hand  coordination.  Hempel  t  Fleishman 

1955 
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underlying  the  pattern  of  correlations  in  a  correlation  matrix.  According 
to  Jones,  factor  analysis  is  an  inductive  procedure.  Factor  analysis 
permits  no  tests  of  theories  or  hypotheses.  Rather,  psychologists  tend  to 
execute  factor  analyses  in  a  mechanical,  unthinking  fashion,  according  to 
Jones.  They  enter  the  raw  data  into  a  computer,  compute  the  correlations 
among  the  variables,  extract  and  rotate  factors  according  to  their  favorite 
extraction  and  rotation  algorithms,  and  then  examine  the  resulting  factor 
pattern  matrix  to  identify  the  underlying  factors. 

Jones  objects  to  this  procedure  on  two  grounds.  First,  he  claims  that 
factor  analysis  does  not  allow  researchers  to  test  a  priori  theories  re¬ 
garding  the  underlying  factor  structure  of  a  set  of  variables.  Jones 
prefers  a  more  deductive,  theory- testing  approach  to  psychology.  Second, 
Jones  believes  that  one  can  learn  a  great  deal  about  the  factor  structure 
of  a  set  of  variables  simply  by  examining  the  pattern  of  correlations  in  a 
correlation  matrix. 

Based  on  his  Investigations,  Jones  (1960,  1962)  concluded  that  the 
pattern  of  correlations  within  a  correlation  matrix  typically  assumes  one 
of  a  relatively  limited  number  of  common  patterns  (e.g.,  simplex  form,  a 
circumplex  form,  monotonic  hierarchy,  rippled  patterns,  tiered  patterns, 
superdiagonal  form,  etc.).  These  patterns  are  Important  because  they  tell 
a  lot  about  the  relationships  between  and  among  factors  and  variables.  The 
patterns  may  suggest  or  reveal  cumulative,  competitive,  developmental, 
and/or  maturational  relationships,  for  Instance.  This  Information  is 
largely  lost  to  the  factor  analyst,  according  to  Jones. 

As  an  alternative  to  factor  analysis,  Jones  recommends  that  resear¬ 
chers  begin  by  developing  a  theory  about  the  factor  structure  underlying 
their  set  of  variables.  Hypotheses  regarding  the  factor  structure  would 
likely  focus  on  the  number  and  possible  nature  of  the  factors,  the  particu¬ 
lar  sources  of  variance  (i.e.,  common  factors)  shared  by  subsets  of  vari¬ 
ables,  and  the  particular  group  of  factors  comprising  each  variable.  Theo¬ 
ries  could  then  be  altered  as  needed  according  to  the  observed  pattern  of 
correlations  in  <he  correlation  matrix.  Jones  suggests  techniques  for 
computing  the  factor  pattern  matrix  for  each  of  the  common  patterns  of 
correlations  he  has  identified.  While  the  factor  pattern  matrix  will 
typically  not  be  an  exact  representation  of  the  correlations  in  the  corre¬ 
lation  matirx,  Jones  (1960)  states  that  one  should  be  able  to  work  back¬ 
wards  and  use  the  factor  pattern  matrix  to  reproduce  closely  the  observed 
correlations  In  the  correlation  matrix.  For  example,  Jones’s  open  conti¬ 
guity  model  was  such  an  accurate  depiction  of  one  correlation  matrix  that 
no  observed  correlation  differed  by  more  than  .03  from  the  correlations  re¬ 
produced  from  the  factor  pattern  matrix. 

Evaluating  the  utility  of  molar  correlational  analysis  Is  somewhat 
difficult.  Jones  (1962)  was  correct  to  criticize  Fleishman  for  relying 
solely  on  Inductive  methods  of  identifying  factors.  In  practice,  however, 
Jone’s  approach  also  relies  quite  heavily  upon  inductive  methods.  In 
almost  all  of  the  examples  in  his  1960  monograph,  Jones  proposed  a  model  to 
explain  the  interrelationships  among  a  set  of  variables  only  after  he 
examined  the  correlation  matrix.  This  is  not  unlike  the  approach  of  most 
factor  analysts,  who  also  rely  on  the  pattern  of  correlations  to  determine 
the  underlying  factor  structure  of  their  variables.  One  might  even  say 
that  factor  analysts  are  a  bit  more  objective  in  their  approach  to 
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Identifying  the  factor  structure.  Factor  analysts  allow  the  computer  to 
"choose"  a  structure  for  them  based  on  a  predetermined  set  of  decision 
rules.  Molar  analysts,  on  the  other  hand,  subjectively  choose  a  model 
based  on  their  perception  of  the  pattern  of  correlations  In  the  correlation 
matrix.  There  would  seem  to  be  more  margin  for  error  and  unreliability 
with  such  an  approach. 

This  is  not  to  say  that  there  are  no  potential  advantages  to  Jones's 
methods.  At  least  two  are  readily  apparent. 

First,  molar  correlational  analysis  would  indeed  permit  researchers  to 
test  their  theories  regarding  the  factor  structure  of  a  set  of  variables. 
All  researchers  would  need  to  do  would  be  to  specify  a  priori  the  structure 
of  their  data  using  one  of  Jones's  models  ai.d  then  determine  how  well  the 
computed  factor  pattern  matrix  accounted  for  the  observed  correlations  in 
the  correlation  matrix.  Unfortunately,  molar  correlational  analysis  pro¬ 
vides  no  goodness-of-fit  statistics  for  testing  the  correspondence  between 
a  data  set  and  a  theory,  and  Jones  offers  only  minimal  guidance  for  de¬ 
termining  whether  or  not  a  factor  model  accurately  depicts  the  observed 
data.  Further,  there  have  been  no  studies  directed  at  the  question  of 
whether  two  or  more  different  factor  models  might  be  able  to  account  equal¬ 
ly  well  for  the  same  data  set.  No  one  has  shown,  for  example,  that  the 
observed  correlations  in  a  correlation  matrix  in  superdiagonal  form  could 
not  be  reproduced  equally  well  by  both  an  open  contiguity  structure  and  a 
simplex  (two  of  the  models  described  by  Jones). 

A  second  apparent  advantage  of  molar  correlational  analysis  Is  that  it 
suggests  the  need  to  look  for  more  complex  relationships  among  variables 
and  factors  than  factor  analysts  are  typically  likely  to  do.  Most  factor 
analysts  continue  to  pursue  Thurstone’s  (1947)  goal  of  simple  structure 
(l.e.,  all  variables  load  on  at  most  two  or  three  factors  and  all  factors 
are  defined  by  a  limited  subset  of  variables).  Jones  presents  a  number  of 
interesting  examples  where  the  principles  of  simple  structure  simply  are 
not  appropriate  for  the  data  under  consideration.  For  example,  he  demon¬ 
strates  that  In  a  typical  study  involving  practice,  performance  on  early 
trials  Is  determined  by  a  number  of  factors  whereas  performance  on  later 
trials  might  be  determined  by  only  one  or  two  factors  (Jones,  1962).  While 
this  does  not  apply  directly  to  Fleishman’s  taxonomoic  research,  it  does 
suggest  Instances  where  traditional  factor  analysis  and  rotation  to  simple 
structure  are  Inappropriate.  Stated  simply,  one  must  consider  carefully 
the  likely  interrelationships  among  the  factors  and  variables  and  choose 
the  method  of  data  analysis  which  is  most  appropriate  for  testing  that 
hypothesized  state. 

An  important  reason  for  discounting  many  of  Jones’s  criticisms,  how¬ 
ever,  Is  that  there  have  been  a  number  of  advances  in  structural  modeling 
and  confirmatory  factor  analysis  over  the  past  20  years  (e.g.,  Jbreskog, 
1978}  which  would  seem  to  be  better  suited  than  molar  correlational  analy¬ 
sis  for  answering  the  structural  questions  which  Jones  was  posing.  It  is 
perhaps  unfair  to  criticize  Jones  for  failing  to  recommend  statistical 
methodologies  which  were  not  to  be  explicated  and  popularized  until  10-15 
years  after  his  1960  monograph.  Yet,  critics  of  Fleishman  often  cite 
Jone's  reanalysls  of  Fleishman  and  Hempel’s  (1954b)  data  in  criticizing 
Fleishman’s  psychomotor  taxonomy.  Instead,  the  consistency  in  Fleishman’s 
results  (see  Table  1)  suggests  that  Fleishman’s  taxonomy  would  fare  quite 
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favorably  if  his  data  were  subjected  to  a  confirmatory  factor  analysis. 

The  Relationship  between  Factors  and  Abilities.  The  second  major 
criticism  of  Fleishman’s  psychomotor  ability  taxonomy  centers  on 
Fleishman’s  interpretation  of  the  nature  of  the  factors  emerging  from  his 
analyses. 

Fleishman  (1975)  has  maintained  that  factor  analysis  is  an  appropriate 
method  of  identifying  the  abilities  underlying  performance  on  a  series  of 
tests  or  tasks.  According  to  Fleishman,  if  two  tests  are  highly  cor¬ 
related,  it  must  be  because  a  common  ability  (or  set  of  abilities)  under¬ 
lies  performance  on  both  tests.  Therefore,  the  factors  emerging  from  a 
factor  analysis  of  psychomotor  tests  can  be  Interpreted  by  determining  the 
ability  common  to  performance  on  the  tests  loading  significantly  on  each 
factor.  Conversely,  each  factor  must  represent  some  real,  underlying 
ability. 

This  view  contrasts  sharply  with  the  opinions  of  several  other  trait 
researchers.  For  example,  Eysenck  has  defined  a  factor  as  "a  hypothetical 
[emphasis  added]  causal  Influence  underlying  and  determining  the  observed 
relationships  between  a  set  of  variables"  (Eysenck,  1953,  p.  108).  Ac¬ 
cording  to  Eysenck,  factors  are  useful  for  providing  a  unifying  structure 
for  viewing  the  world.  Factors  are  not  necessarily  real  entities.  Rather, 
as  with  any  scientific  law  or  postulate,  factors  are  merely  useful  devices 
for  making  sense  of  nature  (Thurstone,  1947).  This  position  is  similar  to 
Anastasl’s  (1983).  In  her  view,  the  primary  purpose  of  factors  is  to 
provide  a  classification  scheme  for  a  set  of  variables.  The  basis  of  this 
classification  scheme  Is,  of  course,  the  Intercorrelations  among  the  tests 
or  variables. 

To  understand  the  nature  of  a  factor,  then,  one  must  understand  (or  at 
least  postulate)  the  common  element  or  elements  linking  the  tests  or  vari¬ 
ables  comprising  the  factor.  Fleishman’s  position  was  that  the  common 
element  was  a  real,  underlying  ability.  Therefore,  based  on  his  factor 
analyses,  he  claimed  that  he  was  able  to  Identify  a  psychomotor  ability 
taxonomy.  Anastasi  (1983),  however,  claims  that  there  are  other  reasons 
that  tests  or  variables  might  be  correlated.  One  of  these  seems  partic¬ 
ularly  germane  to  Fleishman’s  analyses. 

Anastasi  states  that  one  reason  a  factor  might  emerge  is  because  of 
the  contiguity  or  co-occurrence  of  learning  experiences.  For  example,  one 
of  the  factors  which  emerged  in  the  Ohio  State  studies  of  leadership  was  a 
consideration  factor  (Halpin  &  Winer,  1957).  This  factor  was  comprised  of 
items  indicative  of  warmth,  trust,  friendship,  interpersonal  support,  and 
mutual  respect.  In  current  psychological  jargon,  we  would  probably  refer 
to  this  factor  as  interpersonal  skill.  Note,  however,  that  even  though 
this  skill  emerged  as  a  unitary  factor  from  a  factor  analysis,  it  is 
actually  comprised  of  several  distinct  elements.  For  example,  the  ability 
to  develop  warm  personal  relations  with  others  differs  from  the  ability  to 
earn  others’  respect.  These  two  abilities  would  seem  to  rely  on  slightly 
different  cognitive  processes  and  would  certainly  be  manifested  behavior- 
a’ly  in  quite  different  fashions.  Yet,  both  of  these  are  elements  of  the 
interpersonal  skill  factor.  The  most  likely  explanation  of  this  unitary 
factor  is  that  contingencies  on  interpersonal  behavior  operate  at  a  rather 
general  level.  In  any  given  social  situation,  an  individual  is  likely  to 


17 


have  to  draw  on  a  number  of  the  abilities  comprising  Interpersonal  skill. 

As  an  Individual's  behavior  In  Interpersonal  situations  Is  rewarded, 
punished,  and  shaped--depending  In  large  measure  upon  the  Individual’s 
current  standing  on  the  various  component  Interpersonal  ab11ities--a  more 
general  trait  or  ability  may  emerge.  Individuals  with  generally  high 
levels  of  Interpersonal  abilities  may,  for  example,  be  rewarded  for  their 
Interpersonal  behavior.  These  rewards  may  provide  them  with  the  opportuni¬ 
ties  and  confidence  to  practice  and  Improve  the  Interpersonal  abilities  In 
which  they  are  less  competent.  Conversely,  Individuals  who  are  adept  at 
only  a  very  few  Interpersonal  behaviors  may  garner  little  or  no  reinforce¬ 
ment  from  their  attempts  to  Interact  with  others.  As  they  withdraw  from 
Interpersonal  situations,  their  few  existing  Interpersonal  abilities  may 
"atrophy"  due  to  lack  of  practice.  Thus,  based  on  this  all-or-nothing 
reinforcement  pattern,  a  number  of  somewhat  disparate  abilities  may  merge 
and  appear  as  a  broader,  more  general  factor. 

It  Is  possible  that  the  co-occurrence  of  learning  experiences  could 
help  explain  some  of  the  factors  emerging  from  Fleishman’s  analyses.  For 
example,  performance  on  measures  of  rate  control  (e.g.,  Motor  Judgment 
Test,  Rate  Control  Test)  seems  to  depend  on  a  number  of  abilities,  in¬ 
cluding  spatial  orientation,  timing,  arm-hand  steadiness,  and  hand-wrist 
motor  coordination  skills.  Yet,  rate  control  emerges  as  a  single  factor. 

It  may  be  that  there  are  day-to-day  activities  performed  by  many  Indivi¬ 
duals  which  are  similar  to  the  tasks  performed  for  rate  control  tests,  and 
that  "practice"  on  these  rate  control -like  tasks  explains  how  these  dif¬ 
ferent  component  abilities  come  to  emerge  as  a  unitary  psychomotor  factor. 

Given  the  possibility  that  some  of  the  abilities  In  Fleishman’s  psy¬ 
chomotor  ability  taxonomy  may  be  comprised  of  several  more  specific  abili¬ 
ties,  one  might  wonder  whether  a  more  molecular  approach  to  the  psychomotor 
ability  taxonomy  would  be  most  appropriate.  For  example,  advocates  of  an 
information  processing  approach  to  psychomotor  assessment  argue  that  mea¬ 
surement  of  basic  Information  processing  abilities  holds  great  promise  for 
predicting  performance.  (See  the  summary  of  experimental  and  Information 
processing  research  on  psychomotor  skills  in  the  next  section.)  Implicit 
In  the  positions  of  many  Information  processing  researchers  Is  the  belief 
that  the  Information  processing  conceptualization  of  human  abilities  is  the 
most  theoretically  appropriate  approach  to  ability  assessment  because  the 
basic  Information  processing  abilities  serve  as  the  building  blocks  of  all 
of  the  more  complex  abilities  (Carroll,  1976).  Indeed,  many  Information 
processing  researchers  believe  that  some  day  they  will  be  able  to  explain 
all  complex  abilities  in  terms  of  their  component  information  processing 
abilities  (cf.  Sternberg,  1981). 

Still,  for  several  reasons,  Fleishman’s  psychomotor  ability  taxonomy 
seems  preferable  to  an  Information  processing  taxonomy  for  purposes  of  the 
current  selection  and  classification  research. 

first,  on  a  theoretical  level,  it  would  be  naive  to  say  that  the 
abilities  Fleishman  has  Identified  via  factor  analysis  are  somehow  theor¬ 
etically  deficient  simply  because  they  represent  a  complex  amalgamation  of 
a  number  of  more  basic  ability  processes.  Most  ma,jor  theories  of  cognitive 
abilities  posit  a  hierarchy  of  abilities  ranging  from  very  basic  processes 
such  as  encoding,  decoding,  and  storage  to  a  broad,  general  Intellectual 
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ability  factor  (l.e.,  g).  There  is  no  reason  to  suspect  that  psychomotor 
abilities  could  not  also  be  so  structure J.  For  example,  in  folk  parlance, 
people  are  often  described  as  "coordinated"  and  "handy"  or  "uncoordinated" 
and  "klutzy."  This  broad  psychomotor  dimension  seems  analogous  to  g  from 
the  cognitive  domain.  Thus,  the  selection  of  an  appropriate  level  of 
psychomotor  ability  complexity  for  the  current  research  is  really  a  prag¬ 
matic  question:  Which  level  of  complexity  is  likely  to  result  in  measures 
with  the  highest  predictive  validity? 

Second,  assessment  of  Information  processing  abilities  has  been 
plagued  with  psychometric  problems.  Allen,  Rose,  and  Kramer  (1978)  have 
conducted  research  on  the  stability  of  Individual  differences  in  informa¬ 
tion  processing  abilities.  Results  of  their  research  are  summarized  In 
Table  3.  The- reliabilities  of  these  Information  processing  measures  tend 
to  be  rather  low.  Test-retest  reliabilities  (one-day  interval  between  test 
sessions)  ranged  from  .12  for  Cluster  Level  on  the  Sentence  Recall  Task 
to  .85  for  both  Inclusion  Errors  and  Exclusion  Errors  on  the  Physical  Match 
Task,  with  a  median  reliability  of  only  .50.  Other  researchers  have  found 
that  the  stability  of  many  Information  processing  measures  Increases  with 
practice  (Sternberg,  1981).  That  Is,  the  correlation  between  subjects’ 
scores  on  Trials  1  and  2  of  an  Information  processing  test  would  typically 
be  quite  a  bit  lower  than  the  correlation  between  Trials  29  and  30.  This 
lack  of  Initial  stability  has  led  many  Information  processing  researchers 
to  conclude  that  each  subject  should  be  allowed  to  practice  an  Information 
processing  task  for  several  hours  before  assessment  of  the  subject  Is  at¬ 
tempted.  In  contrast,  the  reliabilities  reported  for  the  more  complex 
psychomotor  tests  In  Appendix  A,  which  were  obtained  after  allowing  the 
subject  little  or  no  practice,  are  quite  a  bit  higher.  Almost  all  of  the 
reliabilities  are  .70  or  greater.  Since  the  criterion-related  validity  of 
a  test  Is  limited  by  Its  reliability,  the  complex  psychomotor  tests  are 
more  likely  to  yield  significant  predictive  validity  coefficients  If  time 
constraints  make  It  Impractical  to  allow  subjects  a  great  deal  of  practice 
time  prior  to  assessment. 

Finally,  and  perhaps  most  Importantly,  in  a  validity  study  the  appro¬ 
priate  level  of  ability  complexity  should  be  determined  primarily  by  the 
complexity  of  the  criterion  one  Is  attempting  to  predict  (Hogan  &  Eagan, 
1983).  Listed  below  are  some  of  the  criteria  from  the  current  research 
which  have  been  judged  by  experts  to  be  dependent  on  psychomotor  profi¬ 
ciency: 

o  Repair  Mechanical  Systems.  Perform  corrective  actions  on 
previously  diagnosed  malfunctions  of  mechanical  equipment  or 
mechanical  components  using  appropriate  tools  (e.g., 
wrenches,  screwdrivers,  gauges,  hammers)  in  conjunction  with 
technical  information. 

o  Construct  Wooden  Buildings  and  Other  Structures.  Perform 
carpentry  activities  (e.g.,  measure,  saw,  nail,  plane)  to 
frame,  sheath  and  roof  buildings,  or  to  erect  trestles, 
bridges,  piers,  and  so  forth. 

o  Load  Field  Artillery  or  Tank  Guns.  Manipulate  breech  con¬ 
trols  and  handle  ammunition  (stow  and  load)  to  prepare  guns 
for  firing. 
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Table  3 


One-Dav  Test-Retest  Reliability  of  Information 
Processing  Abilities 


Information  Processing  Measure _ Reliability 

Physical  Match  Task 


Inclusion  Errors 

.85 

Exclusion  Errors 

.85 

Letter  Recall  Task 

Order  Recall 

Slope 

.21 

Intercept 

.51 

Derived  Free  Recall 

• 

Slope 

.19 

Intercept 

.49 

Series  Recall 

Slope 

.24 

Intercept 

.41 

Mental  Addition  Task 

Blanks 

Slope 

.61 

Intercept 

.39 

Correct 

Slope 

.35 

Intercept 

.14 

Sentence  Recall  Task 

Mean  Cluster 

.37 

Maximum  Cluster 

.41 

Cluster  Level 

.12 

Sentence  Recognition  Task 

Errors 

.18 

New-Consistent 

.14 

New-Inconsistent 

.27 

Old-Consistent 

.30 

Old-Inconsistent 

.31 

(Continued) 
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Table  3  (Continued) 


One-Dav  Test-Retest  Reliability  of  Information 

Processing .Abilities 


Information  Processlna  Measure 

Reliability 

Letter  Rotation  Task 

Inclusion  Errors 

Slope 

.53 

Intercept 

.72 

Exclusion  Errors 

Slope 

.75 

Intercept 

.64 

Set  Membership  Task 

Inclusion  Errors 

Slope 

.81 

Intercept 

.66 

Exclusion  Errors 

Slope 

.79 

Intercept 

.66 

Scan  and  Search  Task 

Inclusion  Errors 

Clear 

Slope 

.76 

Intercept 

.77 

Degraded 

Slope 

.71 

Intercept 

.70 

Exclusion  Errors 

Clear 

Slope 

.68 

Intercept 

.63 

Degraded 

Slope 

.60 

Intercept 

.55 

Note.  The  data  are  from  An  information  processing 
approach  to  performance  assessment:  III.  An  elaboration 
and  refinement  of  an  information  processing  battery 
(AIR-58500-TR)  (pp.  56-57)  by  T.  W.  Allen,  A.  M.  Rose, 
and  L.  J.  Kramer,  1978,  Washington,  DC:  American 
Institutes  for  Research.  fi*54 . 
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o  Operate  Track  Vehicles.  Use  various  vehicle  controls  to 
drive  track  vehicles  (e.g.,  tanks,  APCs,  scout  vehicles, 
bulldozers).  Steer  In  response  to  terrain  features. 


o  Provide  Medical  and  Dental  Treatment.  Give  medical  attention 
to  soldiers  In  the  field,  or  medical  or  dental  clinic,  or  to 
animals  (e.g.,  CPR,  splinting  fractures,  administering  Injec¬ 
tions,  dressing  wounds). 

The  description  of  these  criteria  suggests  that  performance  on  these 
tasks  Is  determined  In  large  measure  by  relatively  complex  psychomotor 
abilities.  For  example,  the  definition  of  control  precision  (ability  to 
make  fine  muscular  movements  necessary  to  position  equipment  control 
mechanisms  In  response  to  a  stimulus  whose  speed  and/or  direction  of  move¬ 
ment  are  perfectly  predictable)  bears  great  similarity  to  the  description 
of  some  of  the  activities  comprising  the  criterion  Repair  Mechanical  Sys¬ 
tems.  Indeed,  the  history  of  many  of  the  apparatus  tests  used  by  Fleishman 
(Melton,  1947)  illustrates  that  these  tests  were  designed  specifically  to 
tap  abilities  relevant  to  job  performance  In  the  military.  Validity  evi¬ 
dence,  which  will  be  presented  In  a  subsequent  section,  Indicates  that 
these  complex  psychomotor  tests  have  been  quite  effective  in  serving  that 
function.  Thus  the  complexity  of  Fleishman’s  ability  factors  appears 
perfectly  appropriate  given  the  objectives  of  the  current  research. 

Construct  Explication  of  the  Psvchomotor  Domain 

Anastasl  (1983)  has  noted  that  the  construct  explication  of  a  trait  or 
ability  requires  Input  from  several  sources  In  addition  to  factor  analysis. 
These  sources  might  Include  field  studies,  naturalistic  observation,  task 
analyses,  validity  studies,  controlled  experiments,  and  psychophyslological 
or  behavior  genetic  research. 

Unfortunately,  to  date,  there  has  been  little  effort  expended  on 
explication  of  the  abilities  comprising  Fleishman’s  psychomotor  taxonomy. 
Indeed,  there  have  been  few  attempts  to  review  and  summarize  our  knowledge 
concerning  any  psychomotor  ability  or  taxonomy. 

In  large  measure,  this  lack  of  knowledge  can  be  traced  to  decisions  by 
the  military  In  the  mid  and  late  1950s  to  abandon  psychomotor  testing  in 
the  selection  of  pilots  and  flight  crews  (North  &  Griffin,  1977;  Passey  & 
Mclaurln,  1966).  Ironically,  these  decisions  were  made  during  the  period 
that  Fleishman  was  conducting  his  research  Into  the  taxonomy  of  psychomotor 
abilities. 

Research  conducted  during  World  War  II  had  proven  the  validity  of 
psychomotor  tests  for  predicting  job  and  training  performance  for  several 
military  occupational  specialties.  There  were,  however,  substantial  prob¬ 
lems  with  psychomotor  test  administration,  especially  for  the  apparatus 
tests.  For  example,  most  of  the  apparatus  tests  required  individual  admin¬ 
istration,  making  them  quite  expensive  to  use  and  administer.  Perhaps  even 
more  troublesome  was  the  notorious  unreliability  of  the  various  apparatus 
used  In  the  administration  of  the  tests.  The  Navy,  for  example,  had  de¬ 
cided  during  World  War  II  that  It  would  not  use  any  selection  tests  which 
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could  not  be  administered  easily  In  decentralized  testing  sites  (Vlteles, 
1942).  Since  subjects’  test  scores  often  varied  significantly  from  one 
rudder  control  apparatus  to  the  next,  from  one  complex  coordination  appa¬ 
ratus  to  the  next,  etc..  It  was  Impossible  for  the  Navy  to  ensure  standar¬ 
dization  of  psychomotor  apparatus  tests  among  Its  test  sites.  Therefore, 
the  Navy  ceased  to  use  apparatus  tests.  The  Air  Force  followed  suit  10 
years  later. 

Initially,  some  psychologists  believed  that  new,  more  reliable  psycho¬ 
motor  tests  could  be  developed  to  take  the  place  of  the  original  ACB 
apparatus  tests.  Adams  (1956)  attempted  to  develop  a  series  of  very  simple 
motor  tasks  which  he  hoped  would  tap  the  same  abilities  as  the  ACS  tests. 
Others  worked  on  paper-and-pencll  tests  of  psychomotor  abilities.  Virtu¬ 
ally  all  of  the  validity  results  were  disappointing.  In  a  summary  of  these 
efforts,  Cronbach  (1970)  concluded  that  It  was  unlikely  that  either  oaper- 
and-pencll  tests  or  simple  motor  tests  could  measure  the  complex  dexterity 
and  coordination  abilities  that  the  apparatus  tests  measured. 

Passey  and  Mclaurln  (1966)  were  among  the  first  to  advocate  renewed 
use  of  psychomotor  apparatus  tests  for  pilot  selection.  They  reviewed  both 
the  perceptual -psychomotor  ability  literature  and  the  literature  on  the  be¬ 
havioral  functions  of  a  pilot.  The  goal  of  their  review  was  to  develop  a 
list  of  ability  constructs  to  guide  future  test  development  efforts  for 
pilot  selection.  Their  final  list  consisted  of  19  ability  constructs, 
Including  one  called  psychomotor  ability.  They  felt  that  the  most  relevant 
psychomotor  abilities  for  pilot  performance  would  be  those  Involving  fine, 
highly  controlled  adjustment  (l.e.,  control  precision)  and  multilimb  co¬ 
ordination.  They  also  expressed  Interest  In  a  construct  they  called  an¬ 
ticipatory  behavior.  This  construct  was  quite  similar  to  Fleishman’s  rate 
control  ability  factor  (Fleishman,  1958;  Fleishman  &  Hempel,  1956).  Passey 
and  McLaurln  recommended  that  future  aircrew  selection  tests  consist  of 
"complex  behavioral  tasks  Involving  Intellectual,  sensory-motor,  and  per¬ 
ceptual  components"  (p.  94). 

Imhoff  and  Levine  (1981)  have  also  reviewed  the  perceptual,  psycho¬ 
motor,  and  cognitive  ability  literature  as  part  of  an  effort  to  develop  a 
list  of  ability  constructs  for  Inclusion  In  a  pilot  selection  battery. 
Their  review  represents  one  of  the  most  thorough  efforts  to  explicate  the 
ability  constructs  comprising  the  psychomotor  domain. 

Imhoff  and  Levine  focused  their  summary  of  the  psychomotor  ability 
literature  on  two  major  bodies  of  research:  the  individual  differences 
research  conducted  by  Fleishman  and  his  associates  and  research  on  the  role 
of  feedback  In  motor  skills  learning  conducted  by  a  number  of  motor  skills 
learning  researchers  and  theorists.  Based  on  their  readings,  Imhoff  and 
Levine  concluded  that  most  of  Fleishman’s  psychomotor  ability  factors  could 
be  collapsed  Into  two  major  dimensions. 

The  first  dimension,  basic  movement  speed  and  accuracy,  subsumed 
Fleishman’s  control  precision,  speed  of  arm  movement,  and  reaction  time 
abilities.  According  to  Imhoff  and  Levine,  the  movements  controlled  by 
these  abilities  tend  to  be  repetitive  and  patterned.  Once  these  movements 
are  Initiated  there  Is  little  need  for  feedback  or  situational  cues,  since 
the  movements  require  virtually  no  ongoing  regulation  (Laszlo,  1967;  Laszlo 
&  Manning,  1970).  Little  cognitive  processing  is  required,  and  the 
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movements  are  typically  quite  short  In  duration.  Keele  (1968)  concluded 
that  these  movements  are  probably  under  the  control  of  motor  programs. 

This  led  Imhoff  and  Levine  to  conclude  that  control  precision,  speed  of  arm 
movement,  and  reaction  time  were  probably  Instrumental  In  both  the  develop¬ 
ment  and  the  execution  of  these  programs. 

Imhoff  and  Levine  called  their  second  major  psychomotor  ability  dimen¬ 
sion  perceptual -motor  movement  control.  Included  were  multlllmb  coordina¬ 
tion,  response  orientation,  and  rate  control.  According  to  Imhoff  and 
Levine,  these  three  abilities  are  Important  for  "movement  [which  Is]  guided 
by  sensory  and  perceptual  feedback  from  Initial  responses"  (p.  18).  Move¬ 
ments  controlled  by  this  ability  dimension  tend  to  take  longer  to  execute 
than  the  movements  controlled  by  the  basic  movement  speed  and  accuracy 
dimension.  Moreover,  each  "movement"  under  perceptual -motor  movement  con¬ 
trol  actually  consists  of  a  number  of  short  movements.  The  outcomes  asso¬ 
ciated  with  each  of  these  short  movements  may  have  Important  Implications 
for  the  nature  and  timing  of  subsequent  movements.  Thus,  cognitive  proces¬ 
sing  of  proprioceptive  and  environmental  stimuli  plays  a  key  role  In  per¬ 
ceptual-motor  movement  control.  For  example,  Fleishman  and  Rich  (1963) 
found  that  subjects  classified  as  high  In  kinesthetic  sensitivity  performed 
Increasingly  better  over  10  trials  on  a  two-hand  coordination  task  relative 
to  a  group  classified  low  on  kinesthetic  sensitivity.  Other  modes  of 
feedback  which  have  been  shown  to  affect  performance  on  tasks  controlled  by 
perceptual -motor  movement  abilities  include  visual  (Lackner,  1974),  audi¬ 
tory  (Karlovich  &  Graham,  1968),  and  spatial  orientation  (Weltzman,  1979). 

Imhoff  and  Levine’s  taxonomy  represents  an  Important  contribution  to 
the  psychomotor  ability  literature.  Their  attempt  to  Integrate  motor 
skills  learning  theories  and  research  with  Fleishman’s  taxonomy  represents 
one  of  the  few  attempts  to  explicate  the  psychomotor  ability  domain.  More 
will  be  said  about  the  relationship  between  these  two  Important  bodies  of 
research  In  the  next  section. 

While  Imhoff  and  Levine’s  taxonomy  provides  useful  information  about 
the  relationship  among  Fleishman’s  psychomotor  ability  factors,  their  abil¬ 
ity  dimensions  may  be  too  broad  to  be  useful  In  describing  jobs  or  in  dis¬ 
tinguishing  between  jobs  or  criterion  constructs.  As  noted  previously,  the 
level  of  specificity  in  Fleishman’s  abilities  corresponds  closely  to  the 
level  of  specificity  of  the  criterion  constructs  being  used  in  the  current 
research.  Moreover,  since  Imhoff  and  Levine  were  concerned  only  with  the 
psychomotor  abilities  relevant  to  pilot  performance,  their  taxonomy  failed 
to  consider  several  Important  psychomotor  abilities,  Including  finger  dex¬ 
terity,  manual  deterity,  wrist-finger  speed,  aiming,  and  arm-hand  steadi¬ 
ness.  The  use  of  these  abilities  as  predictors  of  performance  In  the  Army 
deserves  consideration. 

A  Final  Word  on  the  Taxonomy  of  the  Psvchomotor  Domain 

Thus,  In  spite  of  the  many  criticisms  which  have  been  leveled  against 
it  and  the  alternatives  which  have  been  suggested  to  replace  it,  the  psycho¬ 
motor  ability  taxonomy  Identified  by  Fleishman  (1975)  would  seem  to  be  the 
one  most  relevant  to  the  current  Army  research  project.  The  level  of 
ability  specificity  seems  to  correspond  quite  closely  to  the  complexity  of 
the  major  criterion  tasks  Identified  for  this  project.  Moreover,  the  tests 
used  in  the  Identification  of  the  taxonomy  have  been  used  successfully  to 
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predict  military  job  and  training  performance  In  the  past  (Melton,  1947), 
suggesting  that  the  abilities  underlying  performance  on  these  tests  are 
Indeed  related  to  the  criteria  we  are  attempting  to  predict. 

The  one  place  where  this  taxonomy  departs  from  Fleishman’s  (1975),  as 
noted  In  the  introduction  to  this  section,  Is  that  reaction  time  and  re¬ 
sponse  orientation  have  been  Included  in  the  cognitive-perceptual  domain 
Instead  of  In  the  psychomotor  domain.  Our  research  team  will,  of  course, 
continue  to  evaluate  the  utility  of  these  ability  constructs  as  predictors 
for  the  current  research.  Thus,  this  departure  from  Fleishman’s  taxonomy 
should  not  be  viewed  as  a  rejection  of  the  relevance  and  utility  of  these 
abilities.  Rather,  It  should  be  regarded  as  a  simple  disagreement  con¬ 
cerning  the  domain  in  which  these  abilities  should  be  classified  and  dis¬ 
cussed. 
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SECTION  III 
MOTOR  SKILLS  LEARNING 

While  the  focus  of  this  report  is  on  individual  differences  in  psycho¬ 
motor  abilities,  the  vast  majority  of  psychological  research  on  motor 
performance  has  been  conducted  by  experimental  psychologists  Interested  in 
general  laws  and  rules  of  motor  learning.  While  very  little  of  the  re¬ 
search  on  motor  skills  learning  has  been  directly  concerned  with  the  as¬ 
sessment  or  correlates  of  Individual  differences  in  psychomotor  abilities, 
the  motor  skills  learning  literature  still  provides  a  great  deal  of  Infor¬ 
mation  which  Is  useful  in  understanding  psychomotor  abilities.  For  ex¬ 
ample,  the  various  theories  of  motor  skills  learning  which  have  been  pro¬ 
posed  have  provided  an  Improved  understanding  of  the  relationship  between 
psychomotor  abilities  and  cognitive,  perceptual,  and  motor  processes.  Some 
of  the  research  inspired  by  these  models  was  cited  In  the  preceding  section 
In  an  effort  to  define  and  clarify  the  taxonomy  of  the  psychomotor  ability 
domain. 


A  Definition  of  Motor  Skill s  Learning 

Motor  skills  learning  Is  "the  acquisition  of  a  temporal -spatial  organ¬ 
ization  of  muscular  movement  In  a  precise  and  consistent  manner"  (Hall, 
1982,  p.  248).  The  "temporal  organization  of  muscular  movement”  refers  to 
the  fact  that  most  motor  movements  require  precisely  timed  muscular  con¬ 
tractions  and/or  relaxations,  while  the  "spatial  organization  of  muscular 
movement”  refers  to  the  need  to  contract  and  relax  the  appropriate  muscles 
to  the  precise  degree  required  to  achieve  the  outcome  desired  from  the 
motor  movement. 

While  motor  skills  learning  is  typically  Inferred  from  motor  perfor¬ 
mance,  learning  and  performance  are  not  synonymous --not  In  motor  skills  nor 
In  any  other  learning  domain.  That  Is,  what  a  subject  Is  observed  to  do  is 
not  always  Indicative  of  what  a  subject  is  capable  of  doing.  Adams  (1954), 
for  example,  required  subjects  to  align  a  panel  of  lights  via  manipulation 
of  a  hand  and  foot  control.  The  task  was  timed,  with  some  subjects  per¬ 
mitted  more  time  to  complete  the  alignment  than  others.  Adams  found  that 
the  subjects  who  were  given  the  most  time  to  complete  the  alignment  made 
the  most  correct  alignments  per  2-minute  trial,  while  the  subjects  who  were 
given  the  least  amount  of  time  made  the  fewest  correct  alignments  per 
trial.  Subsequently,  Adams  removed  all  time  limits  and  allowed  subjects  to 
complete  the  alignments  at  their  own  pace.  He  found  no  differences  between 
groups  In  the  number  of  alignments  completed  per  trial,  suggesting  that  the 
amount  of  learning  In  all  groups  was  equal. 

Typical  Motor  Performance  Learning  Curves 

Figure  1  shows  typical  learning  curves  for  four  basic  motor  skills 
learning  research  paradigms.  Figure  1A  shows  that,  as  subjects  practice  a 
task,  they  make  fewer  and  fewer  errors  until  performance  reaches  an  a- 
symptote  near  zero  errors.  Thus,  performance  becomes  more  accurate  v»'th 
practice.  Figure  IB  shows  that  the  number  of  correct  responses  per  trial 
Increases  with  practice  as  performance  becomes  faster  and/or  more  accurate 
(i.e.,  less  time  Is  expended  on  Inaccurate  responses).  Research  by 
Crossman  (1959)  Indicates  that  number  of  correct  responses  per  trial  may 
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Typical  learning  curves  for  four  basic  motor  skills  learni 
research  paradigms. 


not  reach  an  asymptotic  level  for  years.  Crossman  found  that  workers  In  a 
cigar  factory  were  still  Increasing  their  rate  of  cigar  production  after 
three  years--or  two  million  c1gars--on  the  job.  Figure  1C  shows  that 
subjects  respond  more  quickly  on  a  motor  task  with  practice,  until  response 
time  approaches  zero.  For  example,  Seibel  (1964)  found  that  his  subjects’ 
simple  reaction  time  was  still  decreasing  and  showing  no  signs  of  reaching 
asymptote,  even  after  75,000  trials  on  a  simple  reaction  time  task.  Fig¬ 
ure  10  applies  primarily  to  tracking  tasks.  A  common  dependent  variable  In 
these  tasks  Is  time  on  target.  The  figure  Illustrates  that  time  on  target 
Increases  (l.e.,  performance  becomes  more  accurate)  with  practice.  Indeed, 
all  four  figures  Illustrate  how  performance  Improves,  but  Improves  at  a 
decreasing  rate,  as  practice  progresses  until  subjects’  performance  reaches 
a  maximal,  asymptotic  level. 

The  Course  of  Motor  Skills  Learning 

Stages  of  Motor  Skills  Acquisition.  Bryan  and  Harter  (1897)  studied  a 
group  of  subjects  who  were  learning  to  use  Morse  Code.  The  researchers 
noted  that  subjects’  performance  Improved  consistently  throughout  many 
trials  until  their  level  of  performance  reached  an  apparent  asymptote. 
Subsequently,  following  several  trials  where  performance  Improved  very 
little,  performance  again  began  to  Improve  steadily.  Bryan  and  Harter 
dubbed  this  the  plateau  phenomenon.  They  hypothesized  that  during  the 
performance  plateau,  subjects  were  altering  their  strategies  for  performing 
the  task.  While  subjects’  Initial  strategies  were  efficient  during  the 
first  trials  of  task  learning,  Bryan  and  Harter  theorized,  the  strategies 
had  to  be  altered  In  order  for  performance  to  become  more  automated  and 
efficient. 

Since  Bryan  and  Harter’s  Initial  discovery,  the  plateau  phenomenon  has 
been  replicated  by  a  number  of  researchers.  Some  of  these  researchers  have 
suggested  that  the  phenomenon  Is  artifactual.  According  to  these  re¬ 
searchers,  the  phenomenon  appears  to  occur  only  because  of  the  Insensiti¬ 
vity  of  our  measuring  devices;  learning  actually  occurs  evenly  throughout 
all  skill  acquisition  trials.  Most  researchers,  however,  agree  with  Bryan 
and  Harter’s  view  that  motor  skills  learning  occurs  In  stages. 

The  most  widely  cited  stage  model  of  motor  skills  acquisition  Is  Fitts 
and  Posner’s  (1967).  According  to  Fitts  and  Posner,  learning  occurs  In 
three  stages. 

During  the  first  stage,  the  subjects  attempt  to  understand  the  nuances 
of  the  task  and  the  desired  outcome.  Much  of  the  subjects’  efforts  are 
invested  In  thinking  and  processing  information.  They  must  determine  which 
situational  cues  are  relevant  to  task  execution.  They  must  learn  the  motor 
steps  Involved  in  task  performance.  Often,  they  will  say  these  steps 
aloud,  as  If  for  guidance  as  they  perform  the  task.  Because  this  phase  of 
the  task  Is  distinguished  by  a  considerable  degree  of  cognitive  activity, 
Fitts  and  Posner  called  it  the  cognitive  stage. 

During  the  second  stage  of  task  learning,  the  subjects’  activities 
shift  from  "what  to  do"  to  "how  to  do  It."  The  subjects  attempt  a  number 
of  different  responses  in  an  effort  to  find  the  most  efficient  means  of 
achieving  the  desired  outcome.  Erroneous  responses  are  abandoned,  more 
successful  response  elements  are  combined  with  successful  elements  of  other 


response  efforts,  and  the  subject  begins  to  develop  an  effective,  in¬ 
tegrated  motor  skill.  This  emphasis  on  experimentation  and  Integration 
prompted  Fitts  and  Posner  to  call  this  the  associative  stage  of  motor 
skills  acquisition.  * 

Entering  the  final  phase  of  motor  skills  acquisition,  the  subjects  are 
already  quite  proficient  at  the  task.  During  this  phase,  performance  will 
only  Improve  somewhat.  Nevertheless,  Important  changes  occur  in  the  manner 
In  which  the  subjects  accomplish  the  task.  The  subjects  learn  to  perform 
the  task  proficiently  without  paying  much  attention  to  what  they  are  doing. 
This  frees  them  to  concentrate  on  any  other  tasks  at  hand.  In  addition, 
the  subjects  begin  to  use  Internal  kinesthetic  and  proprioceptive  cues  as 
feedback  to  tell  them  whether  they  are  performing  the  task  correctly.  The 
former  reliance  on  external  sources  of  feedback  and  Information  is  dimin¬ 
ished.  To  reflect  the  lessened  attentlonal  demands  of  the  task  during  this 
stage,  Fitts  and  Posner  named  this  the  automatic  or  autonomous  stage  of 
motor  skills  acquisition. 

The  Role  of  Abilities  In  Skills  Acquisition.  Stage  models  of  motor 
skills  acquisition  are  particularly  interesting  to  differential  psycholo¬ 
gists  because  they  suggest  that  the  abilities  contributing  to  skilled 
performance  may  vary  depending  on  the  skill  acquisition  stage.  Fleishman 
and  his  colleagues  have  been  among  the  most  active  researchers  In  pursuing 
this  area  of  overlap  between  experimental  and  correlational  psychology. 

At  the  heart  of  Fleishman’s  research  Is  the  distinction  he  and  his  as¬ 
sociates  draw  between  abilities  and  skills.  According  to  Parker  and 
Fleishman  (1960),  "The  term  ability  refers  to  a  more  general,  stable  trait 
of  an  Individual  which  may  facilitate  performance  In  a  variety  of  different 
tasks"  (p.  1).  For  example,  manual  dexterity  may  be  related  to  tasks  as 
diverse  as  building  a  birdhouse  and  filling  a  cavity  in  a  tooth.  The  term 
skill,  on  the  other  hand,  refers  to  an  Individual’s  level  of  proficiency  on 
a  particular  task.  One  might  speak,  for  example,  of  an  Individual  who  is 
skilled  at  building  birdhouses.  This  skill  may  be  dependent  upon  a  number 
of  different  abilities  (e.g.,  manual  dexterity,  finger  dexterity,  numerical 
facility). 

All  theories  of  motor  skills  learning  recognize  that  Individuals 
become  proficient  at  complex  tasks  via  practice.  Fleishman  and  other 
differential  psychologists  note,  however,  that  even  with  extended  practice 
Individuals  will  differ  greatly  in  their  proficiency  on  a  given  task. 
Moreover,  Individuals  will  also  differ  in  their  proficiency  on  a  task 
during  their  very  first  effort  at  performing  that  task.  In  an  effort  to 
understand  this  phenomenon  more  clearly,  Fleishman  and  his  colleagues  have 
Investigated  a  number  of  features  of  the  skills  acquisition  process.  The 
research  questions  they  have  addressed  Include: 

1.  What  abilities  are  related  to  performance  on  complex  tasks 
during  the  Initial  phases  of  task  practice  and  learning? 

2.  What  abilities  are  related  to  performance  on  complex  tasks 
during  the  final  phases  of  task  practice  and  learning  (i.e., 
when  Individuals  have  reached  an  asymptotic  level  of  task 
proficiency)? 
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3.  How  stable  are  Individual  differences  In  task  proficiency 
from  the  Initial  to  the  final  phase  of  task  practice  and 
learning? 

4.  Do  the  abilities  contributing  to  Individual  differences  in 
task  proficiency  during  the  initial  phase  of  task  practice 
and  learning  differ  from  those  contributing  to  Individual 
differences  during  the  final  phase  of  task  practice  and 
learning? 

5.  In  comparison  with  other  parameters  governing  skilled  task 
performance  (e.g.,  massed  vs.  distributed  practice,  amount 
of  time  elapsed  since  the  task  was  last  practiced),  how 
Important  are  Individual  ability  differences  In  determining 
an  Individual's  level  of  task  proficiency? 

To  answer  these  questions,  Fleishman  and  his  associates  conducted 
several  studies  (e.g.,  Fleishman,  1960;  Fleishman  &  Fruchter,  1960; 
Fleishman  &  Hempel,  1954b,  1955;  Parker  &  Fleishman,  1960).  In  a  typical 
study,  these  researchers  administered  a  battery  of  psychomotor  and 
cognitive-perceptual  reference  tests  to  a  sample  of  200-300  subjects. 
Subsequently,  subjects  received  extended  practice  on  a  complex  criterion 
task.  Factor  analysis  was  used  to  identify  the  abilities  represented  In 
the  reference  test  battery.  The  factor  loadings  of  the  various  criterion 
task  practice  trials  on  these  ability  factors  represented  the  correlations 
between  the  abilities  and  the  criterion  task  at  various  stages  of  practice. 

Fleishman  (1967,  1972,  1975)  has  reported  four  general  conclusions 
based  on  this  research: 

1.  As  practice  continues,  the  particular  combination  of  abili¬ 
ties  contributing  to  task  performance  changes. 

2.  These  ability  changes  are  progressive  and  systematic  from 
trial  to  trial.  Eventually  the  changes  cease  and  the  rela¬ 
tive  contributions  of  the  various  abilities  to  task  perfor¬ 
mance  stabllze. 

3.  Psychomotor  abilities  become  relatively  more  important  de¬ 
terminants  of  task  performance  (vis-a-vis  cognitive- 
perceptual  abilities  such  as  spatial  or  verbal  ability)  as 
practice  continues. 

4.  As  practice  continues,  an  ever-increasing  percentage  of  task 
performance  variance  Is  specific  to  the  task  itself  ( 1 . e . , 

It  Is  unrelated  to  any  of  the  broad,  general  abilities 
represented  In  the  reference  test  battery). 

These  conclusions  have  been  bolstered  by  other  studies  conducted  by 
Fleishman  and  his  colleagues  using  different  research  designs.  In  one  of 
these  studies,  Fleishman  (1957)  performed  two  separate  factor  analyses  for 
a  test  battery.  For  one  factor  analysis,  the  test  battery  data  included 
scores  from  a  complex  task  taken  from  the  Initial  stages  of  practice  on 
that  task.  For  the  second  factor  analysis,  the  test  battery  included 
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scores  for  the  same  task  from  a  later  stage  of  practice.  In  two  other 
studies,  Fleishman  (1965)  and  Fleishman  and  Fruchter  (1965)  examined  the 
relationship  between  total  task  performance  and  performance  on  task  com¬ 
ponents  at  various  stages  of  practice.  Data  from  all  of  these  studies  were 
consistent  with  the  four  basic  conclusions  noted  above. 

Other  researchers  have  also  Investigated  the  relationships  between 
psychomotor  skills  and  abilities.  Kohfeld  (1966)  found  that  correlations 
between  performance  on  a  complex  psychomotor  task  and  a  verbal  test  de¬ 
creased  consistently  over  15  practice  trials  on  the  task,  while  correla¬ 
tions  between  the  task  and  a  psychomotor  test  Increased  with  task  practice. 
Adams  (1953,  1957)  and  Hinrichs  (1970)  also  found  that  the  ability  corre¬ 
lates  of  a  complex  psychomotor  task  changed  as  the  task  was  practiced. 
Unlike  Fleishman,  however,  Hlnrlchs  found  that  task  performance  was  equally 
predictable  from  the  ability  tests  during  early  and  late  practice  trials. 
According  to  Hlnrlchs,  there  was  no  evidence  that  task-specific  variance 
Increased  as  the  task  was  practiced.  Hlnrlchs  attributed  this  finding  to 
the  fact  that  he  Included  ability  tests  which  closely  resembled  the  complex 
task  In  his  test  battery.  Hlnrlchs  found  that  the  correlations  between 
these  tests  and  the  task  actually  Increased  as  the  task  was  practiced. 

This  finding  Is  certainly  not  Inconsistent  with  Fleishman’s  finding  that 
variance  specific  to  a  task  Increases  with  practice. 

Models  of  Skill-Ability  Relationships.  Three  different  models  have 
been  developed  In  an  effort  to  explain  Fleishman’s  four  major  research 
findings  summarized  previously.  These  models  were  described  and  evaluated 
In  a  1971  technical  report  by  Hulln  and  Alvares. 

The  changing  task  model,  which  Fleishman  (1966)  espoused,  suggests 
that  the  abilities  required  for  proficient  task  performance  change  with 
practice.  An  individual’s  level  of  task  proficiency  on  a  given  practice 
trial  thus  depends  on  both  the  abilities  possessed  by  the  Individual  and 
the  degree  to  which  each  of  these  abilities  contribute  to  task  performance 
at  that  stage  of  practice. 

Adams  (1957),  Humphreys  (1960),  and  others  have  offered  a  competing 
model,  the  changing  subject  model.  While  there  are  important  differences 
In  the  changing  subject  models  offered  by  the  various  advocates  of  this 
position,  all  of  these  models  suggest  that  practice  on  a  complex  task 
actually  changes  an  Individual’s  ability  level.  It  is  therefore  no  sur¬ 
prise  that  correlations  between  the  task  and  ability  test  scores  taken 
before  the  task  was  first  practiced  should  decrease  as  practice  continues; 
the  practice  Is  actually  changing  the  Individual’s  true  ability  level.  One 
of  the  most  Important  differences  between  the  changing  task  and  changing 
subject  models  Is  that  the  changing  subject  model  makes  no  distinction 
between  skills  and  abilities.  According  to  the  changing  subject  model, 
changes  In  task  proficiency  are  changes  In  ability. 

The  third  model  represents  a  combination  of  the  first  two  models 
(Hulin  &  Alvares,  1971).  Like  the  changing  subjects  model,  this  third 
model  makes  no  distinction  between  skills  and  abilities.  Thus,  changes  in 
task  proficiency  are  attributed  at  least  In  part  to  an  individual’s  im¬ 
proving  abilities.  The  third  model  also  postulates,  however,  that  the 
abilities  involved  in  successful  task  performance  change  as  practice  on  the 
task  continues. 
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According  to  Hulin  and  Alvares,  data  on  psychomotor  abilities  and 
psychomotor  task  proficiency  provide  no  clear  Indication  of  which  model  Is 
most  accurate.  None  of  the  three  models  can  be  rejected  based  on  currently 
available  data. 

The  validity  of  these  three  models  Is  Important  because  the  models 
have  differing  implications  for  training.  The  changing  task  model,  for 
example,  suggests  that  the  most  appropriate  training  strategy  Is  to  focus 
training  on  the  abilities  Involved  In  task  performance  during  each  stage  of 
practice.  In  a  test  of  this  hypothesis,  Parker  and  Fleishman  (1961)  used 
two  different  training  strategies  In  teaching  subjects  to  perform  a  complex 
tracking  task.  One  of  the  strategies  was  a  "common  sense"  strategy  de¬ 
signed  to  resemble  typical  military  training  and  Instruction.  Subjects 
assigned  to  this  condition  received  a  verbal  description  and  demonstration 
of  the  task.  They  were  then  allowed  to  practice  the  task  Individually. 
During  this  practice,  they  received  guidance  and  assistance  from  Instruc¬ 
tors.  The  training  curriculum  for  the  "experimental"  group  was  designed  to 
provide  subjects  with  an  opportunity  to  Improve  abilities  which  were  known 
to  be  related  to  task  performance  at  various  stages  of  practice.  For 
example,  previous  analysis  of  the  task  revealed  that  spatial  orientation 
was  most  highly  related  to  task  performance  during  the  fourth  tracking 
session.  Therefore,  subjects  were  provided  with  special  Instructions  de¬ 
signed  to  acquaint  them  with  the  spatial  orientation  requirements  of  the 
task  prior  to  and  during  the  third  tracking  session.  An  analysis  of  track¬ 
ing  error  scores  revealed  that  the  experimental  group  learned  the  tracking 
task  more  quickly  and  attained  a  higher  level  of  tracking  proficiency. 

These  differences  In  tracking  performance  persisted  even  after  the  two 
groups  began  to  receive  Identical  training  beginning  with  the  11th  tracking 
session. 

Additional  Research  on  the  Role  of  Individual  Differences  In  Learning. 
Fleishman  and  his  colleagues  have  also  Investigated  the  relationship  be¬ 
tween  Individual  difference  variables  and  several  other  learning  and  train¬ 
ing  phenomena. 

Fleishman  and  Rich  (1963)  showed  that,  as  practice  continues,  correla¬ 
tions  between  performance  on  a  two-hand  coordination  task  and  spatial - 
visual  abilities  decreased  while  correlations  between  task  performance  and 
kinesthetic  abilities  Increased.  These  findings  suggest  that  the  spatial 
cues  which  subjects  use  to  learn  how  to  perform  a  task  diminish  in  Impor¬ 
tance  as  the  task  is  learned.  Instead,  subjects  begin  to  rely  more  on 
proprioceptive  feedback  (l.e.,  knowledge  of  the  "right  feel"  or  the  "right 
touch")  as  they  become  more  proficient. 

In  a  study  on  skill  retention,  Fleishman  and  Parker  (1962)  examined 
the  reliability  of  Individual  differences  in  task  proficiency.  Initially, 
Fleishman  and  Parker  administered  a  battery  of  ability  tests  to  their 
subjects.  Subjects  then  practiced  a  complex  psychomotor  task  extensively 
for  seven  weeks  before  a  period  of  no  practice  was  instituted.  This  no¬ 
practice  period  lasted  either  1,  4,  9,  14,  or  24  months.  At  the  conclusion 
of  the  no  practice  period,  subjects  again  completed  the  task.  Fleishman 
and  Parker  found  that  task  performance  during  the  final  set  of  practice 
trials  correlated  very  highly  with  performance  on  the  task  after  the  long 
no-practice  period  (r’s  ranged  from  .80  to  .90).  None  of  the  ability 
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measures  contributed  additional  significant  variance  to  the  prediction  of 
of  task  performance  following  the  no-practice  period.  Fleishman  and  Parker 
therefore  concluded  that  subjects'  level  of  retention  was  based  solely  on 
the  habits  they  had  acquired  during  practice. 

A  final  example  of  Fleishman’s  research  on  the  Impact  of  Individual 
differences  on  learning  phenomena  Is  a  study  by  Fleishman  and  Ellison 
( 1-969 )  on  Interference/transfer  and  massed  vs.  distributed  practice  ef¬ 
fects.  (The  next  part  of  this  section  contains  a  somewhat  more  detailed 
discussion  of  transfer  and  massed  vs.  distributed  practice  effects.)  Sub¬ 
jects  In  this  study  completed  a  battery  of  spatial,  psychomotor,  and  per¬ 
sonality  tests.  They  then  received  both  massed  and  distributed  practice 
trials  on  a  complex  psychomotor  task.  Following  extended  practice,  the 
controls  on  the  task  apparatus  were  shifted.  Subjects  completed  several 
trials  on  this  altered  apparatus.  Finally,  the  apparatus  was  reconfigured 
to  its  original  state  and  subjects  completed  one  last  trial. 

Results  of  the  study  showed  that  a  few  of  the  ability  measures  were 
somewhat  useful  In  predicting  which  subjects  would  experience  transfer  or 
Interference  effects  when  the  apparatus  was  altered.  None  of  the  correla¬ 
tions  was  very  great,  however.  In  addition,  there  was  no  evidence  that  any 
of  the  personality  measures  were  useful  In  predicting  which  subjects  would 
show  "flexibility"  and  transfer.  One  of  the  most  interesting  findings  from 
this  study  concerned  the  prediction  of  performance  on  the  first  trial 
following  a  massed  practice  trial.  On  virtually  every  trial,  multiple  R 
squared  between  task  performance  and  the  ability  measures  was  between  .40 
and  .50.  During  the  trial  Immediately  following  massed  practice,  though, 
multiple  R  squared  dropped  to  .20.  In  their  evaluation  of  this  result, 
Fleishman  and  Ellison  suggested  that  the  ability  requirements  for  task 
performance  during  massed  and  distributed  practice  trials  are  Identical. 
Ability  requirements  appear  to  change  following  massed  practice,  however, 
perhaps  because  of  the  "fatiguelike"  state  Induced  by  massed  practice. 

Variables.. Affecting  Motor  Skills  Learning 

Cognitive,  perceptual,  and  psychomotor  abilities  represent  Individual 
differences  variables  affecting  the  rate  of  acquisition  of  and  ultimate 
level  of  proficiency  in  skilled  motor  performance.  Host  researchers, 
however,  have  focused  on  how  situational  and  motivational  variables  affect 
motor  skills  learning  across  Individuals.  Much  of  this  research  has  cen¬ 
tered  on  five  key  variables:  massed  vs.  distributed  practice,  part  vs. 
whole  task  learning,  stimulus-response  compatibility,  knowledge  of  results, 
and  feedback. 

Massed  vs.  Distributed  Practice.  Massed  practice  refers  to  a  condi¬ 
tion  where  subjects  practice  a  single  task  repeatedly  with  no  Intervening 
periods  of  rest.  The  opposite  Is  distributed  practice,  where  practice 
trials  are  Interrupted  by  rest  periods  or  periods  of  practice  on  other 
tasks. 

In  a  classic  study,  Ammons  (1950)  provided  rest  Intervals  of  zero,  20, 
or  50  seconds;  2,  5,  or  12  minutes;  or  24  hours  between  thirty-six  20- 
second  practice  trials  on  a  rotary  pursuit  task.  Ammons  found  that  the 
time  on  target  scores  of  the  massed  (l.e.,  zero-second  rest  Interval) 
practice  group  were  considerably  lower  than  those  of  the  distributed 
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practice  groups  throughout  all  36  trials.  There  was  no  apparent  relation¬ 
ship  between  the  length  of  the  rest  period  and  time  on  target  scores, 
however,  suggesting  to  Ammons  that  all  forms  of  distributed  practice  are 
equally  superior  to  massed  practice  In  facilitating  motor  skills  learning. 
This  finding  has  been  replicated  in  numerous  studies. 

More  recent  research  has  focused  on  the  persistence  of  massed  vs.  dis¬ 
tributed  practice  effects.  Whitley  (1970)  administered  25  trials  of  a 
tracking  task  to  two  groups  of  subjects.  One  group  received  massed  prac¬ 
tice,  the  other  distributed  practice.  From  the  fifth  trial  through  the 
last,  the  performance  of  the  distributed  practice  group  grew  Increasingly 
superior  to  the  performance  of  the  massed  practice  group.  This  finding 
replicated  the  results  of  much  previous  research.  Whitley  continued  his 
experiment,  however.  Following  the  25th  trial,  he  gave  both  groups  a  5- 
mlnute  rest  period.  Then  he  administered  10  more  trials  to  each  group. 
Ourlng  these  10  trials,  there  was  no  difference  In  the  performance  of  the 
two  groups.  Moreover,  both  groups  performed  at  a  skill  level  comparable  to 
the  pre-rest  skill  level  of  the  distributed  practice  group.  Thus,  the 
performance  of  the  massed  practice  group  showed  a  considerable  Improvement 
from  the  final  pre-rest  trial  to  the  first  post-rest  trial. 

Based  on  these  results,  Singer  (1980)  stated  that  "there  Is  little 
reason  to  doubt  the  superiority  of  distributed  practice  over  massed  prac¬ 
tice  for  Immediate  performance  In  a  variety  of  tasks"  (p.  420).  He  added, 
however,  that  there  Is  no  evidence  that  skill  retention  Is  facilitated  by 
distributed  practice.  The  common  belief  that  distributed  practice  Is 
superior  to  massed  practice  Is  not  clearly  supported  by  currently  available 
data. 


Part  YS,  Whole  Task  Learning.  Many  complex  tasks  can  be  taught  by 
breaking  the  task  Into  subtasks  or  parts  and  then  teaching  the  task  one 
part  at  a  time.  This  Is  known  as  part  task  learning  or  the  part  method  of 
learning.  This  contrasts  with  whole  task  learning,  in  which  the  subject 
attempts  to  learn  to  perform  the  task  as  an  Integrated  whole.  Another 
variant  Is  the  progressive-part  or  continuous-part  procedure,  in  which  the 
subject  practices  the  first  subtask  first,  then  the  first  two  subtasks 
together,  then  the  first  three  subtasks  together,  and  so  forth. 

Psychologists  and  educators  have  long  been  Interested  In  determining 
which  type  of  task  learning  Is  most  conducive  to  skill  acquisition.  Re¬ 
sults  from  this  body  of  research  have  been  mixed,  however,  with  no  clear 
evidence  in  favor  of  either  part  or  whole  task  learning  (Hall,  1982; 

Singer,  1980). 

In  an  effort  to  explain  these  apparently  contradictory  results,  Naylor 
and  Briggs  (1963)  tested  the  hypothesis  that  the  appropriate  method  of 
learning  depends  upon  the  complexity  and  organization  of  the  task.  Task 
complexity  refers  to  the  Information  processing  demands  of  the  task.  Com¬ 
plex  tasks  require  subjects  to  think  a  lot  about  what  they  are  doing,  to 
remember  Information  from  previous  experiences  or  practice  trials,  to  make 
judgments  about  appropriate  methods  for  performing  the  task  given  con¬ 
tinuously  changing  environmental  cues,  and  so  forth.  Task  organization 
refers  to  the  Interrelationships  among  the  various  dimensions  or  subtasks 
comprising  the  task.  Highly  organized  tasks  consist  of  highly  interrelated 
and  Interdependent  subtasks.  A  task  with  low  organization  consists  of 
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several  relatively  Independent  subtasks.  The  Naylor-Brlggs  model  suggests 
that  part  task  learning  Is  superior  when  tasks  are  high  In  complexity  and 
low  In  organization,  while  whole  task  learning  Is  superior  for  tasks  which 
are  low  In  complexity  and  high  In  organization. 

While  the  model  has  some  Intuitive  appeal,  there  are  no  data  to  sup¬ 
port  or  refute  It  at  this  time.  One  problem  with  evaluating  the  model  is 
that  It  Is  often  difficult  to  rate  the  organization  and  complexity  of  a 
task.  For  example,  organization  and  complexity  ratings  for  the  task 
"swinging  a  golf  club"  are  highly  unreliable  (Singer,  1980).  In  addition, 
even  when  experts  do  agree  that  a  task  is  best  taught  In  parts,  they  often 
differ  considerably  on  how  to  divide  the  task  into  parts  for  part  task 
learning.  This  again  makes  comparison  of  part  and  whole  task  learning 
difficult.  Thus,  the  current  state  of  affairs  In  this  area  provides  almost 
no  general  rules  for  determining  when  and  where  part  task  learning  will  be 
superior  or  inferior  to  whole  task  learning. 

Stimulus-Response  Comoatlbll Itv.  One  variable  with  a  tremendous  Im¬ 
pact  on  motor  skills  learning  Is  the  compatibility  between  the  stimulus  and 
the  response.  Stimulus-response  compatibilities  are  derived  from  stereo¬ 
typed  behavioral  response  patterns  shared  by  nearly  everyone  within  a 
population.  For  example,  the  turn  signal  In  an  automobile  operates  In  a 
compatible  manner  (Hall,  1982).  To  signal  a  turn  to  the  right,  one  must 
push  the  turn  signal  clockwise,  to  the  right.  Conversely,  one  must  push 
the  turn  signal  counterclockwise,  to  the  left,  to  signal  a  left  turn.  This 
task  Is  learned  quickly  by  almost  everyone  who  Is  learning  to  drive  a  car. 
It  Is  likely,  however,  that  reversing  the  operation  of  the  signal  (I.e.,  so 
that  a  left  turn  would  be  signaled  by  pushing  right  on  the  turn  signal) 
would  result  In  an  Incompatible  stimulus  and  response  and  therefore  greatly 
Increase  the  difficulty  of  task  learning. 

The  example  above  Illustrates  the  potential  benefits  of  designing 
tasks  so  that  stimuli  and  responses  are  compatible.  Part  of  the  AAF  Avia¬ 
tion  Psychology  Program  during  World  War  II  was  devoted  to  such  research 
(Wlckert,  1947).  Alarmed  by  the  high  loss  of  lives  and  planes  caused  by 
pilots’  spatial  disorientation  during  combat  missions,  the  AAF  directed  the 
Aviation  Psychology  Program  to  investigate  possible  solutions.  Psycholo¬ 
gists  used  the  critical  Incident  technique  to  gather  data  on  disorientation 
experiences  from  pilots  (Flanagan,  1954).  These  data  helped  psychologists 
Identify  a  number  of  stimulus-response  Incompatlblltles  within  the  cockpit 
of  an  airplane.  Psychologists  Involved  in  this  study  were  subsequently 
able  to  issue  a  number  of  recommendations  which  led  to  improvements  in  the 
design  of  cockpits,  controls,  and  instrument  panels  in  combat  airplanes. 

Two  apparatus  have  been  used  extensively  In  laboratory  investigations 
of  stimulus-response  compatibility. 

The  first  Is  a  choice  reaction  time  apparatus  (see  Figure  2).  Sub¬ 
jects  begin  each  trial  by  resting  their  Index  finger  on  the  home  button. 

One  of  the  four  stimulus  lamps  Is  then  lit.  Normally,  a  subject  responds 
to  this  stimulus  by  moving  the  Index  finger  and  depressing  the  button 
adjacent  to  the  lighted  lamp.  When  the  machine  Is  In  an  Incompatible  mode, 
though,  the  assignment  of  stimulus  lamps  to  response  buttons  Is  random. 

That  Is,  the  response  button  assigned  to  a  particular  stimulus  lamp  is  not 
necessarily  the  button  adjacent  to  that  lamp. 
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Figure  2.  Four-choice  reaction  time  apparatus. 


The  second  apparatus  (Hall,  1982)  consists  of  eight  pathways  or  spokes 
emanating  from  a  central  hub  (see  Figure  3).  Adjacent  pathways  are  sepa¬ 
rated  by  45°  angles.  At  the  end  of  each  pathway  are  lamps  that  can  be  used 
as  test  stimuli.  A  digital  clock-like  device  mounted  atop  the  apparatus  can 
also  be  used  to  present  stimuli.  At  the  beginning  of  each  trial,  the 
subject  must  place  a  metal  stylus  in  the  center  of  the  hub.  When  the 
apparatus  is  in  compatible  mode,  the  subject  must  trace  with  the  stylus 
through  the  pathway  in  the  direction  corresponding  to  the  stimulus  pre¬ 
sented.  For  example,  if  the  lamp  at  the  end  of  the  right  horizontal  pathway 
was  lighted  (or  if  "3:00"  was  presented  on  the  clock-like  device),  the 
subject  would  trace  with  his  stylus  across  the  right  horizontal  pathway  as 
quickly  as  possible.  When  the  stimulus  is  in  an  incompatible  mode,  however, 
assignment  of  responses  to  stimuli  is  random.  Thus,  a  display  of  "3:00" 
might  correspond  to  a  response  of  tracing  down  the  lower  vertical  pathway. 

Not  surprisingly,  results  from  both  the  reaction  time  and  the  pathway 
apparatus  show  that  subjects'  reaction  times  are  much  greater  when  the 
stimulus  and  response  are  incompatible.  The  results  from  the  second 
apparatus  are  Interesting,  however,  because  they  Illustrate  how  the  strength 
of  the  compatibility  between  a  stimulus  and  a  response  affects  reaction  time 
when  the  compatibility  is  eliminated.  In  one  study  involving  this  appara¬ 
tus,  Fitts  and  Deininger  (1954)  found  mean  reaction  times  of  .35  second  and 
.64  second  for  the  lamps  and  clock,  respectively,  when  the  apparatus  was  in 
compatible  mode.  These  data  suggest  that  the  compatibility  was  greater  for 
the  lamps  than  for  the  clock.  When  the  machine  was  in  incompatible  mode, 
however,  reaction  times  were  faster  for  the  clock  than  for  the  lamps  (.77 
second  vs.  .96  second).  Thus,  the  data  show  that  interfering  with  a  highly 
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compatible  stimulus-response  relationship  causes  large  decrements  in  skilled 
motor  performance.  The  resulting  incompatible  responses  are  even  more 
difficult  to  learn  than  the  incompatible  responses  resulting  from  tasks 
which,  In  a  compatible  mode,  had  relatively  weaker  stimulus-response 
relationships. 

Knowledge  of  Results.  Knowledge  of  results  (KR)  refers  to  information 
about  the  effectiveness  of  task  performance  received  from  external  sources. 
In  some  cases,  a  subject  receives  KR  by  observing  or  otherwise  sensing 
(e.g.,  smelling,  listening  to)  the  results  of  his  actions.  In  other  in¬ 
stances,  the  subject  obtains  KR  from  other  Individuals  (e.g.,  an  experi¬ 
menter)  . 

Thorndike  (1927)  demonstrated  the  power  of  KR  in  a  very  simple  experi¬ 
ment.  He  blindfolded  subjects  and  asked  them  to  draw  lines  which  were 
either  three,  four,  five,  or  six  inches  Tong.  He  then  provided  some  sub¬ 
jects  with  KR.  That  Is,  when  the  subject’s  line  was  within  one-eighth  inch 
of  the  desired  length,  Thorndike  would  say  "right."  Otherwise,  Thorndike 
would  say  "wrong."  Thorndike  provided  no  KR  to  a  control  group.  Results 
indicated  that  the  group  receiving  KR  became  significantly  more  accurate  in 
their  drawing  during  the  course  of  the  600  trials,  while  the  control  group 
showed  no  Improvement. 

In  subsequent  studies,  researchers  have  varied  a  number  of  KR  parame¬ 
ters  to  determine  their  effects  on  motor  skills  learning.  Results  for  some 
of  the  most  Important  parameters  are  summarized  below. 
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Several  studies  have  shown  that  the  frequency  of  KR  Is  related  to  task 
performance.  In  one  study  (Bilodeau  &  Bilodeau,  1958),  subjects  were  given 
KR  either  every  trial,  every  third  trial,  every  fourth  trial,  or  every 
tenth  trial.  Bilodeau  and  Bilodeau  found  that  Increased  frequency  of  KR 
was  related  to  reductions  In  the  degree  of  task  performance  error. 

Increasing  the  amount  and  precision  of  KR  also  promotes  motor  skills 
learning.  During  a  tracking  task,  Smode  (1958)  provided  one  group  of 
subjects  with  continuous  KR  while  a  second  group  was  told  about  their 
performance  only  after  each  tracking  trial  had  been  completed.  Mean  time 
on  target  was  significantly  greater  for  the  first  group.  In  a  similar 
experiment,  Hardesty  and  Bevan  (1964)  recorded  the  performance  of  four 
groups  on  a  reaction  time  task.  One  group  received  no  feedback  (control), 
a  second  group  was  told  only  that  their  performance  was  "very  good,"  "too 
slow,"  and  so  forth  after  each  trial  (qualitative),  a  third  group  was  told 
their  reaction  time  In  seconds  after  each  trial  (quantitative),  and  a 
fourth  group  received  both  qualitative  and  quantitative  KR  after  each 
trial.  The  qualitative-quantitative  group  had  the  fastest  mean  reaction 
time,  followed  by  the  quantitative  group,  the  qualitative  group,  and  fin¬ 
ally  the  control  group. 

The  effects  of  a  delay  in  KR  have  been  studied  extensively.  Results 
show  that  the  nature  and  timing  of  the  delay  determine  how  the  delay  will 
affect  motor  skills  learning  and  task  performance.  Simply  delaying  KR  for 
a  period  of  several  seconds  has  no  effect  on  performance  (Lorge  & 

Thorndike,  1935).  If,  however,  KR  Is  delayed  so  that  one  or  more  trials 
Intervene  between  the  target  trial  and  KR,  then  task  performance  Is  ad¬ 
versely  affected.  Moreover,  the  number  of  trials  Intervening  between  the 
target  trial  and  KR  Is  positively  related  to  the  degree  of  decrement  In 
task  performance  (Bilodeau,  1956).  The  amount  of  time  elapsing  between  KR 
and  the  onset  of  the  next  trial  also  affects  task  performance.  Weinberg, 
Guy,  and  Tupper  (1964)  investigated  the  effects  of  post-KR  delay  intervals 
of  1,  5,  10,  and  20  seconds.  They  found  that  performance  was  poorest  in 
the  group  which  received  only  a  one-second  KR  delay.  They  found  no  perfor¬ 
mance  differences  among  the  three  remaining  groups.  Weinberg  et  al.  con¬ 
cluded  that  subjects  require  more  than  one  second  to  process  KR  effec¬ 
tively.  With  only  a  one-second  KR  delay,  subjects  are  unable  to  use  KR  to 
Improve  their  performance. 

A  final  parameter  of  KR  which  has  received  considerable  attention  Is 
withdrawal  of  KR.  Newell  (1974)  asked  his  subjects  to  move  a  slide  9.4 
Inches  along  a  track  In  150  milliseconds.  At  the  completion  of  each  trial, 
Newell  asked  his  subjects  to  estimate  their  movement  time.  Newell  used  six 
KR  groups.  One  group  received  KR  on  every  trial.  For  the  other  five 
groups,  KR  was  withdrawn  after  2,  7,  17,  32,  or  52  trials.  For  all  six 
groups,  whenever  KR  was  provided,  It  was  provided  only  after  subjects  had 
estimated  their  movement  time.  Subjects  In  all  six  groups  completed  75 
trials  of  the  movement  time  task.  In  general,  Newell  found  that  movement 
time  tended  to  depart  from  150  milliseconds  after  KR  was  withdrawn.  The 
discrepancy  was  smaller,  however,  for  subjects  who  received  extensive  KR 
before  KR  withdrawal.  Moreover,  the  group  which  received  KR  for  52  trials 
before  KR  withdrawal  showed  no  performance  decrement  following  withdrawal. 
Newell  concluded  that  extensive  KR  creates  Internal  standards  which  allow 
subjects  to  judge  the  accuracy  of  their  performance  without  input  from 
external  sources.  His  analysis  of  subjects’  estimated  movement  times 
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confirmed  this.  Subjects  Mho  received  extensive  KR  gave  very  accurate 
estimates  of  their  movement  times,  even  after  KR  withdrawal,  while  subjects 
who  received  little  KR  gave  very  Inaccurate  estimates  of  their  movement 
times. 

Feedback.  When  motor  skills  researchers  speak  of  feedback,  they  refer 
to  a  variable  that  Is  distinct  from  KR.  KR  denotes  Information  arising 
from  external  sources.  KR  Is  based  on  outcomes  occurring  outside  the 
subject.  Feedback,  on  the  other  hand,  arises  from  sources  within  the 
subject.  Feedback  Is  based  on  proprioceptive  stimulation.  For  example, 
subjects  may  sense  how  far  they  have  moved  an  arm,  how  hard  they  have 
squeezed  a  peg,  or  how  much  they  have  bent  a  wrist.  These  sensations  are 
all  examples  of  proprioceptive  stimulation. 

In  the  next  part  of  this  section,  some  of  the  major  theories  of  motor 
skills  learning  will  be  presented.  The  role  of  feedback  In  motor  skills 
learning  Is  a  major  source  of  disagreement  among  these  theories.  There¬ 
fore,  a  discussion  of  research  results  from  feedback  studies  will  be  post¬ 
poned  and  Integrated  Into  the  presentation  of  the  competing  theories  below. 

Theories  of  Motor  Skills  Learning 

Several  theories  of  motor  skills  learning  have  been  offered  to  explain 
the  course  and  stages  of  motor  skills  acquisition.  These  theories  consist 
primarily  of  descriptions  of  the  processes  occurring  within  an  Individual 
during  motor  skills  learning.  Thus,  the  focus  of  these  theories  has  been 
on  motivational  variables  (e.g.,  KR  and  feedback)  rather  than  on  task  or 
stimulus  variables  (e.g.,  massed  vs.  distributed  practice,  part  vs.  whole 
task  learning,  stimulus-response  compatibility). 

Habit  Theory.  Perhaps  the  oldest  and  simplest  theory  of  motor  skills 
learning  Is  the  theory  that  learning  builds  a  habit  state  In  an  Individual. 
When  the  habit  state  Is  sufficiently  strong,  habit  theory  suggests  that  the 
behavior  will  occur  reliably. 

The  weakest  part  of  habit  theory  is  that  It  fails  to  consider  many 
Interesting  learning  phenomena.  For  example,  habit  theory  states  that  most 
motor  skills  learning  occurs  through  KR.  Yet,  KR  alone  Is  insufficient  to 
explain  why  motor  skills  learning  occurs  In  stages.  KR  provides  no  Infor¬ 
mation  about  the  processes  underlying  these  stages.  Nor  Is  habit  very 
useful  In  explaining  transfer.  Transfer  refers  to  the  fact  that  motor 
skills  learning  that  has  occurred  In  one  situation  can  facilitate  motor 
skills  learning  In  related  situations  (Ammons,  Ammons,  &  Morgan,  1958). 
Habit  suggests  no  mechanisms  for  such  a  phenomenon. 

In  sum,  habit  theory  Is  not  so  much  wrong  as  deficient  theoretically. 
As  a  psychological  construct,  habit  fails  to  provide  Insight  into  the 
mechanisms  and  processes  underlying  motor  skills  learning.  Thus,  motor 
skills  researchers  have  moved  beyond  habit  theory  in  thei -  efforts  to 
describe  and  explain  motor  skills  learning. 

Cybernetic  Theories.  Cybernetics  Is  the  study  of  control  systems  in 
machines  and  animals.  It  Is  concerned  with  how  machines  and  humans  regu¬ 
late  themselves  in  order  to  maintain  Internal  systems  within  standards  of 
performance  (I.e.,  by  ensuring  that  errors  do  not  exceed  limits  of 
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tolerance).  Perhaps  the  most  familiar  example  of  a  cybernetic  control 
system  Is  the  bimetallic  thermostat.  The  metal  In  the  thermostat  acts  as  a 
switch  to  turn  a  furnace  on  and  off.  Heat  and  cold  cause  the  metal  to 
expand  and  contract,  respectively,  regulating  whether  the  furnace  Is  off  or 
on.  No  Intervention  from  outside  sources  Is  required  to  control  the  fur¬ 
nace;  the  thermostat  Is  constructed  to  be  self-regulating. 

The  key  to  any  cybernetic  control  system  Is  feedback  (l.e.,  Informa¬ 
tion  arising  from  within  the  machine  or  organism).  For  this  reason,  cyber¬ 
netic  control  systems  are  often  called  closed-loop  systems.  With  respect 
to  motor  skills,  this  means  performance  Is  evaluated  and  regulated  pri¬ 
marily  on  the  basis  of  Internal  Information,  such  as  proprioceptive  and 
kinesthetic  feedback.  The  cybernetic  control  theory  of  motor  skills 
learning  which  has  received  the  most  attention  Is  Adams'  (1971)  closed-loop 
theory. 

According  to  Adams,  during  the  early  stages  of  motor  skills  learning, 
the  subject  conceptualizes  motor  learning  as  a  problem  to  be  solved.  He 
uses  KR  to  guide  him  In  solving  this  problem.  He  tries  new  responses, 
receives  and  processes  KR,  and  adjusts  his  responses  accordingly.  As  the 
subject  practices  the  task,  he  receives  feedback  from  Internal  sources  as 
well.  This  feedback  might  Include  proprioceptive  stimuli  from  receptors  in 
joints  and  muscles,  tactual  stimuli  from  nerve  cells  In  the  fingers,  and 
kinesthetic  stimuli  based  on  body  position  and  movement.  These  stimuli 
form  a  reference  mechanism  which  Adams  calls  a  perceptual  trace.  As  per¬ 
formance  becomes  more  proficient,  the  subject's  perceptual  trace  becomes 
stronger  and  a  more  valuable  source  of  Information.  During  any  given 
attempt  to  execute  a  motor  response,  a  subject  can  compare  feedback  from 
ongoing  proprioceptive  stimulation  to  performance  standards  embedded  within 
the  perceptual  trace.  This  allows  the  subject  to  regulate  his  performance 
more  efficiently  than  If  he  were  forced  to  wait  and  attend  to  only  KR. 

Thus,  In  the  closed-loop  system,  feedback  gradually  replaces  KR  as  the  key 
regulator  of  skilled  motor  performance. 

Besides  perceptual  trace,  the  other  major  construct  In  Adams’  closed- 
loop  theory  Is  memory  trace.  The  memory  trace  contains  the  Information 
needed  to  Initiate  a  movement.  It  Is  activated  by  appropriate  environ¬ 
mental  cues,  and  then  Initiates  and  specifies  the  speed  and  direction  of 
movement.  The  memory  trace  Is  not  a  rigid  motor  program,  however.  Once 
the  movement  Is  Initiated,  the  perceptual  trace  and  feedback  provide  the 
subject  with  the  means  to  regulate  and  alter  any  movement  Initiated  by  the 
memory  trace. 

Adams’  theory  Is  very  useful  In  explaining  the  transition  from  the 
associative  to  the  automatic  stage  of  motor  skills  learning.  The  theory 
can  also  account  for  the  finding  that  withdrawal  of  KR  has  no  effect  on 
performance  If  the  subject  has  received  extensive  KR  prior  to  KR  withdrawal 
(Newell,  1974).  According  to  Adams,  the  reason  performance  does  not  suffer 
following  KR  withdrawal  is  that  the  subject’s  behavior  has  become  self¬ 
regulating. 

There  Is  some  research  evidence,  though,  which  suggests  that  skilled 
motor  performance  can  occur  In  the  absence  of  any  proprioceptive  feedback. 
In  a  classic  experiment  involving  laboratory  rats,  Lashley  and  Ball  (1929) 
severed  the  nerves  carrying  the  proprioceptive  stimuli  from  muscles  and 
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joints  to  the  motor  centers  of  the  brain  after  the  rats  had  learned  to  run 
a  maze.  Following  the  operation,  the  rats*  motor  coordination  was  somewhat 
Impaired.  Nevertheless,  the  rats*  ability  to  traverse  the  maze  correctly 
was  almost  totally  unaffected.  Skilled  performance  persisted  even  In  the 
absence  of  feedback.  Lashley  (1951)  has  also  pointed  out  that  a  pianist's 
fingers  can  strike  up  to  16  keys  per  second  accurately,  which  appears  to  be 
much  faster  than  a  human  closed-loop  control  system  could  operate  (cf. 

Adams,  1976).  Thus,  many  researchers  lean  towards  a  theory  of  motor  skills 
learning  which  posits  more  centralized  control  of  movement. 

Hierarchical  Control  Theories.  Adams’  closed-loop  theory  stresses  the 
role  of  feedback  In  providing  ongoing  regulation  of  movement.  Other  theor¬ 
ists  have  questioned  whether  humans  actually  regulate  their  movements  to 
the  extent  Adams  has  suggested.  Lashley’s  research  (Lashley,  1951;  Lashley 
&  Ball,  1929)  provides  at  least  two  examples  where  closed-loop  control  of 
movement  Is  either  Impossible  or  highly  Implausible.  Lennenberg’s  (1967) 
finding  that  speech  may  require  up  to  several  hundred  muscle  movements  per 
second  provides  another  example  of  motor  performance  which  would  seem  to 
occur  too  rapidly  to  be  much  affected  by  feedback. 

In  an  effort  to  account  for  these  findings,  several  researchers  have 
offered  open-loop  theories  of  motor  skills  learning.  The  theories  are 
open-loop  In  the  sense  that  they  provide  no  mechanism  for  ongoing  regula¬ 
tion  of  movement.  The  theories  all  suggest  that  once  movement  Is  Initiated 
by  central  processes  It  Is  rarely,  If  ever,  affected  by  feedback.  Thus, 
according  to  these  theories  movement  control  and  regulation  Is  part  of  the 
domain  of  higher-order  processing  functions. 

Keele  (1968)  has  proposed  a  motor  program  theory  of  motor  skills 
learning.  Keele  would  concur  with  Adams  that  KR  and  feedback  produce  both 
perceptual  and  memory  traces.  In  Keele’s  theory,  however,  the  memory  trace 
Is  more  Important  than  the  perceptual  trace  in  motor  movement.  The  memory 
trace  Is  not  simply  a  mechanism  for  Initiating  movement.  Instead,  the 
memory  trace  represents  a  fairly  detailed  specification  of  the  muscle 
movements  required  to  perform  a  particular  task.  Each  motor  program  runs 
for  200  to  400  milliseconds  (Schmidt,  1975).  Once  a  program  is  Initiated, 
it  Is  virtually  impossible  to  alter  movement  until  the  program  runs  its 
course. 

Keele  (1973)  Indicates  that  the  chief  role  of  feedback  in  motor  skills 
learning  Is  In  the  development  of  motor  programs.  Feedback,  along  with  KR, 
helps  to  create  and  perfect  motor  programs.  Once  a  program  is  functioning 
efficiently,  feedback  does  not  affect  the  functioning  of  that  program 
during  motor  movement.  If  new  Information  (l.e.,  from  KR)  suggests  that 
the  program  requires  alteration,  feedback  will  aid  in  modifying  the  pro¬ 
gram.  Otherwise,  the  role  of  feedback  in  motor  skills  learning  is  ended 
once  the  motor  program  Is  functioning  properly. 

Keele’s  theory  is  particularly  applicable  to  brief  movements  lasting 
for  but  a  fraction  of  a  second.  These  movements  are  often  called  ballistic 
acts.  Keele  (1968)  has  theorized  that  complex  movements  can  also  be  under¬ 
stood  In  terms  of  his  motor  program  theory,  however.  According  to  Keele, 
complex  movements  are  nothing  more  than  a  series  of  ballistic  acts.  During  . 
the  course  of  such  movements,  ongoing  visual,  tactile,  auditory,  and  kines¬ 
thetic  stimulation  help  determine  the  timing  and  sequence  of  ballistic 


acts.  In  this  manner,  both  feedback  and  KR  play  a  crucial  role  In  complex 
movement.  Yet,  at  the  most  basic  level,  the  ballistic  acts  themselves  are 
under  the  exclusive  control  of  motor  programs  and  cannot  be  altered  during 
the  course  of  movement  execution. 

Evidence  In  favor  of  the  motor  prog  am  theory  comes  primarily  from  two 
sources.  Evidence  from  research  with  human  subjects  cited  above  (e.g., 
Lennenberg,  1967)  suggests  that  some  movement  occurs  too  rapidly  to  be 
under  continuous,  ongoing  regulation.  Adams  (1976),  however,  has  presented 
evidence  from  physiological  studies  which  indicates  that  Information  can  be 
processed  through  feedback  loops  In  as  little  as  10  milliseconds.  Thus, 
this  first  source  of  evidence  may  not  be  as  powerful  as  once  thought.  The 
second  source  of  evidence  comes  from  animal  studies  (e.g.,  Delcomyn,  198C- 
Lashley  &  Ball,  1929;  Wilson,  1961).  Results  from  these  studies  suggest 
that  elimination  of  proprioceptive  stimulation  is  not  accompanied  by  degra¬ 
dation  of  performance.  Closed-loop  theorists  have  questioned  the  applica¬ 
bility  of  these  results  for  humans,  however. 

Adams,  Goetz,  and  Marshall  (1972)  have  conducted  one  of  the  only  tests 
of  the  competing  predictions  of  closed-loop  and  open-loop  theories.  During 
the  course  of  motor  skills  learning,  Adams  et  al .  changed  the  conditions  of 
feedback.  For  one  group,  the  feedback  change  occurred  early  In  practice. 
For  a  second  group,  the  change  occurred  late  In  practice.  According  to 
Adams  et  al.,  closed-loop  theory  predicts  that  feedback  change  would  have 
Its  greatest  Impact  late  In  practice,  when  performance  Is  being  closely 
regulated  by  feedback.  Open-loop  theory  predicts  that  feedback  change 
would  have  little  effect  on  performance  late  In  practice  because  movement 
during  this  stage  Is  controlled  primarily  by  motor  programs.  Adams  et  al . 
found  that  performance  degradation  was  much  greater  when  the  feedback 
occurred  late  In  practice,  providing  support  for  Adams’  closed-loop  theory. 

Information  Processing  Theories.  The  most  common  application  of  In¬ 
formation  processing  theories  has  been  to  provide  models  for  the  investiga¬ 
tion  of  basic  cognitive  processes  such  as  attention,  perception,  encoding, 
storage,  retrieval,  and  decision  making.  As  the  name  Implies,  Information 
processing  research  concerns  the  manner  In  which  Individuals  attend  to  and 
process  sensory  stimuli. 

Most  Information  processing  research  has  been  concerned  with  deter¬ 
mining  the  limits  of  human  cognitive  processing  capabilities.  Cralk  and 
Lockhart  (1972),  for  example,  Investigated  limitations  In  the  ability  to 
store  Information  in  short-term  storage.  Broadbent  (1958)  described  limits 
In  the  ability  to  attend  to  two  or  more  stimuli  simultaneously. 

Figure  4  presents  an  information  processing  model  devised  by  Welford 
(1968).  Welford’s  model  is  particularly  Interesting  for  two  reasons. 

First,  Welford’s  model  Illustrates  clearly  the  measurement  dilemma 
faced  by  information  processing  researchers.  In  a  typical  information 
processing  experiment,  researchers  present  an  "external  object"  to  the 
subject.  Subsequently,  they  measure  some  overt  response  produced  by  an 
"effector."  These  overt  responses  are  the  only  measurable  output  avail¬ 
able.  From  these  responses,  Information  processing  researchers  attempt  to 
Infer  the  nature  of  the  cognitive  processes  occurring  within  the  subject. 
Welford’s  model  In  Figure  4  represents  a  rather  simple  information 
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Figure  4.  An  information  processing  model  of  motor  skills  learning.  From 
Fundamentals  of  skill  by  A.  T.  Welford,  1968,  London:  Methuen.  Copyright 
1968  by  Methuen.  Reprinted  by  permission. 
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processing  model --and  It  Includes  five  processes  intervening  between  the 
stimulus  and  the  response.  None  of  these  processes  are  measurable.  Thus, 
experimental  tests  of  Information  processing  hypotheses  often  must  rely  on 
measurement  of  phenomena  occurring  several  steps  removed  from  the  target 
process.  This  frequently  makes  it  difficult  to  test  information  processing 
hypotheses  adequately. 

Second,  Welford’s  theory  represents  one  of  the  few  attempts  to  link 
motor  skills  learning  to  Information  processing  theory.  The  processes 
Wei  ford  calls  "choice  of  response"  and  "control  of  response"  are  especially 
relevant  to  skilled  performance. 

Welford’s  information  processing  theory  is  primarily  descriptive, 
especially  with  respect  to  the  processes  involved  in  selecting  and  exe¬ 
cuting  motor  responses.  Thus,  It  is  difficult  to  compare  h^s  model  di¬ 
rectly  with  Adams’  or  Keele’s.  Welford  nas  been  particularly  interested  in 
Investigating  how  attentlonal  limitations  affect  motor  response  time.  For 
example,  he  has  described  how  responses  to  a  signal  are  delayed  If  the 
signal  Is  presented  immediately  following  another  signal  (Welford,  1976). 
Based  on  this  and  other  research,  Welford  has  concluded  that  central  pro¬ 
cessing  mechanisms  must  be  heavily  Involved  In  motor  responses.  When  these 
mechanisms  are  attending  to  other  stimuli,  as  in  the  example  above,  re¬ 
sponse  time  to  motor  movement  cues  Is  lengthened  considerably. 

An  Integrative  Model  of  Motor  Skills  Learning.  Singer  (1980)  has 
noted  that  cybernetic,  hierarchical  control,  and  Information  processing 
theories  tend  to  focus  on  different  aspects  of  the  motor  skills  learning 
process  and  thus  are  more  complementary  than  Incompatible.  He  has  devel¬ 
oped  an  integrative  model  of  motor  skills  learning  that  borrows  elements 
from  all  three  theories. 

Information  processing  theory  is  primarily  concerned  with  the  cognitive 
processing  that  occurs  during  skilled  performance.  Welford's  (1968,  1976) 
work  has  demonstrated  how  attentional  limits  affect  motor  response  time  and 
performance.  During  the  cognitive  stage  of  motor  skills  acquisition,  for 
example,  subjects  must  pay  attention  to  the  task  at  hand  in  order  to  detect 
key  features  of  the  task  that  might  be  helpful  in  determining  an  appropriate 
motor  response  strategy.  Also  during  this  stage,  attention  is  vital  to  the 
storage  of  task  information  in  long-term  memory.  Transferring  task  infor¬ 
mation  from  short  term  memory  to  long  te*m  memory  can  only  occur  as  the 
subject  rehearses  (i.e.,  attends  to)  the  task  information  (Craik  &  Lockhart, 
1972).  Even  after  motor  skills  learning  reaches  the  automatic  stage, 
attention  continues  to  play  a  key  role  in  motor  performance.  As  Welford 
(1976)  showed,  if  the  subject  fails  to  attend  to  situational  information  or 
is  in  the  process  of  responding  to  other  cues  and  signals  when  new  situa¬ 
tional  information  is  presented,  motor  responses  either  will  not  occur  or 
will  be  delayed  significantly. 

Adams’  (1971)  cybernetic  closed-loop  theory  is  useful  in  describing 
the  increasing  importance  of  feedback  and  the  perceptual  trace  as  motor 
skills  acquisition  passes  through  the  associative  and  automatic  stages. 
Adams’  theory  also  explains  the  important  role  KR  plays  in  the  learning 
process.  Research  by  Adams  et  al .  (1972)  demonstrates  that,  at  least  under 
certain  conditions,  feedback  continues  to  function  as  a  regulator  of  task 
performance  even  after  a  task  has  been  mastered  by  the  subject. 
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Keele’s  (1968)  hierarchical  control  theory  explains  most  motor  func¬ 
tioning  during  the  automatic  stage  of  motor  skills  acquisition.  During 
this  stage,  there  appears  to  be  very  little  regulation  of  performance; 
Instead,  most  motor  movement  appears  to  be  pre-programmed.  Information 
processing  Is  still  Important  during  this  learning  stage,  however,  because 
the  subject  must  determine  the  proper  sequence  and  timing  to  execute  these 
motor  programs. 

Singer's  (1980)  integrative  model  suggests  a  number  of  fruitful  areas 
for  future  research.  Much  more  research  is  needed  to  describe  the  role  of 
attention  and  decision  making  during  the  cognitive  stage  of  motor  skills 
acquisition.  A  description  of  the  cognitive  processes  involved  in  se¬ 
quencing  and  timing  the  ballistic  acts  comprising  complex  movements  is 
another  area  ripe  for  research.  Of  course,  the  roles  of  KR  and  feedback 
during  the  automatic  phase  of  motor  skills  acquisition  continue  to  be  hotly 
debated.  Singer's  model  indicates  that  these  also  deserve  consideration  in 
future  research. 

Implications  of  Motor  Skills  Learning  Theories 
for  Psvchomotor  Ability  Research 

In  the  preceding  section,  It  was  noted  that  a  major  problem  with 
Fleishman’s  ability  taxonomy  is  that  the  abilities  have  never  been  de¬ 
scribed  and  explicated  thoroughly.  In  developing  definitions  of  these 
abilities,  Fleishman  attempted  to  judge  the  common  ability  demands  of  the 
tests  loading  on  each  ability  factor.  By  1962,  Fleishman  had  conducted 
sufficient  factor  analytic  research  to  allow  him  to  specify  a  comprehensive 
psychomotor  ability  taxonomy.  Since  that  time,  however,  very  little  re¬ 
search  has  been  directed  at  developing  a  nomologlcal  net  for  these  abili¬ 
ties. 


Recognizing  this  problem,  Imhoff  and  Levine  (1981)  showed  how  findings 
from  research  on  motor  skills  acquisition  could  be  used  to  identify  and 
explain  the  relationships  among  Fleishman’s  abilities.  For  example,  they 
noted  that  the  type  of  motor  performance  described  by  Keele’s  (1968)  open- 
loop  theory  was  similar  to  the  types  of  tasks  performed  in  measures  of 
control  precision,  reaction  time,  and  speed  of  arm  movement.  This  implies 
that  two  of  the  most  Important  processes  involved  In  these  abilities  are 
detection  (l.e.,  to  determine  when  to  Initiate  a  movement)  and  movement 
speed  (l.e.,  to  permit  rapid  responding).  Keele’s  description  also  sug¬ 
gests  some  processes  that  would  be  relatively  unimportant  to  these  abili¬ 
ties.  These  include  monitoring  feedback  and  KR,  since  the  movements  in¬ 
volved  In  measures  of  these  abilities  are  under  the  control  of  motor  pro¬ 
grams  . 

Previously,  it  was  noted  that  Imhoff  and  Levine  built  their  psycho¬ 
motor  taxonomy  around  the  role  of  feedback  in  task  performance.  Singer's 
integrated  model  suggests  a  number  of  other  processes  and  parameters  that 
might  be  considered  when  describing  Fleishman's  abilities.  These  include: 

1.  The  role  of  situational  information  in  initiating  movement. 


2.  The  difficulty  of  detecting  situational  information  (e.g.,  due  to 
weak  or  competing  stimuli). 

3.  The  attentlonal  demands  of  task  learning. 

4.  The  attentlonal  demands  of  task  performance  when  motor  skills 
acquisition  has  reached  the  automatic  stage. 

5.  Sources  of  KR  and  feedback. 

Imhoff  and  Levine’s  review  reflects  their  expert  judgment  of  the 
relationship  between  motor  movement  processes  and  parameters  and 
Fleishman’s  psychomotor  abilities.  Empirical  research  Is  needed  to  supple¬ 
ment  and  validate  these  judgments.  Such  research  could  be  modeled  after 
the  cognitive  correlates,  cognitive  components,  cognitive  training,  and 
cognitive  contents  research  paradigms  described  by  Sternberg  (1981).  Even¬ 
tually,  such  research  might  Indicate  improved  ways  of  categorizing  psycho¬ 
motor  abilities.  'Certainly  it  would  Improve  our  understanding  of  how  these 
abilities  operate. 


46 


SECTION  IV 


THE  USE  OF  PSYCHOMOTOR  TESTS  IN  APPLIED  PSYCHOLOGICAL  RESEARCH 

For  both  legal  and  pragmatic  reasons,  organizations  using  selection 
tests  must  be  concerned  with  the  validity  of  those  tests.  The  most  widely 
accepted  method  of  establishing  the  validity  of  a  selection  test  involves 
demonstrating  that  test  scores  are  related  to  one  or  more  indices  of  job 
performance.  This  is  known  as  criterion-related  or  empirical  validity. 
According  to  Kleiman  and  Faley  (1978),  since  the  passage  of  the  1964  Civil 
Rights  Act,  the  courts  have  tended  to  rely  almost  totally  on  criterion- 
related  validity  evidence  in  determining  the  validity  of  a  selection  test 
or  procedure.  In  response  to  the  courts'  position,  industrial  psycholo¬ 
gists  have  also  focused  their  validation  efforts  on  establishing  the  cri¬ 
terion-related  validity  of  their  selection  tests.  The  criterion-related 
validity  evidence  for  psychomotor  tests  will  be  evaluated  in  a  subsequent 
part  of  this  section. 

Kleiman  and  Faley,  however,  argued  that  the  current  emphasis  on 
criterion-related  validity  Is  erroneous.  Because  of  small  sample  sizes  and 
Inadequate  personnel  research  staffs  and  budgets,  many  organizations  lack 
the  resources  to  conduct  quality  criterion-related  validity  research.  In 
such  1nstances--and  even  In  Instances  where  criterion-related  validity 
research  Is  economically  feasib1e--Kleiman  and  Faley  suggested  that  the 
validity  of  selection  tests  can  be  established  via  content  validation. 

Content  Valuation  or  Tests 

Content  validity  Is  established  by  demonstrating  that  a  test  consti¬ 
tutes  a  representative  sample  of  the  behaviors  comprising  the  domain  to  be 
measured  (Anastasl,  1976;  Helmstadter,  1964).  With  respect  to  selection 
testing,  this  means  that  one  must  demonstrate  that  the  content  of  a  test  is 
a  representative  sample  of  the  behaviors  comprising  the  job  performance 
domain.  Generally,  the  content  validity  of  a  selection  test  Is  established 
by  conducting  a  thorough  job  analysis,  extracting  situations  from  the  job 
performance  domain  for  use  as  test  Items  or  stimuli,  assessing  subjects’ 
responses  to  those  situations,  and  evaluating  those  responses  against  known 
standards  of  job  performance. 

Many  psychologists  have  noted  that  content  validity  alone  does  not 
ensure  that  a  test  will  be  valid  (l.e.,  that  a  test  will  do  what  It  is 
Intended  to  do)  (Casclo,  1982;  Gulon,  1977).  This  is  particularly  applic¬ 
able  for  selection  tests,  whose  primary  or  sole  function  is  to  predict  how 
well  job  applicants  will  perform  on  the  job. 

Nevertheless,  the  principles  of  and  procedures  for  content  validation 
can  be  useful  In  guiding  test  development.  Guion  (1977),  for  example, 
argued  that  content  validity  helps  ensure  that  test  items  reflect  an  appro¬ 
priate  operational  definition  of  the  constructs  or  behaviors  comprising  the 
criterion  domain.  Wernimont  and  Campbell  (1968)  distinguished  between 
tests  as  signs  and  tests  as  samples  of  behavior.  In  their  view,  tests 
consisting  of  job  samples  and  work  samples  are  more  directly  related  to  the 
criterion  of  interest  (l.e.,  they  are  more  content  valid)  than  tests  that 
are  signs  or  indirect  measures  of  the  criterion.  Because  of  this  direct 
relationship,  Wernimont  and  Campbell  predicted  that  the  criterion-related 
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validity  of  "samples"  tests  should  generally  be  greater  than  that  of 
"signs"  tests. 

In  sum,  content  validity  alone  does  not  ensure  that  a  test  will  be 
useful  for  predicting  job  performance.  It  does  seem  reasonable,  however, 
to  expect  that,  on  the  average,  content  valid  tests  will  have  greater 
criterion-related  validity  than  tests  which  lack  content  validity. 

Anastasl  (1976)  has  stated  that  psychomotor  tests  are  content  valid  if 
they  require  the  subject  to  "closely  reproduce  all  or  part  of  the  movements 
required  In  the  performance  of  the  job  Itself. ... (T)o  ensure  validity,  the 
test  and  the  job  should  call  for  the  use  of  the  same  muscle  groups" 

(p.  444). 

The  Identification  of  the  movements  and  motor  skills  required  for 
successful  job  performance  Is  dependent  on  careful  job  analysis.  As  noted 
above,  job  analysis  should  always  be  the  first  step  In  the  content  valida¬ 
tion  process  for  selection  tests  (Equal  Employment  Opportunity  Coordinating 
Council,  1978). 

The  AAF  aircrew  selection  research  that  was  described  briefly  in  the 
taxonomy  section  represents  the  first  systematic  effort  to  apply  the  prin¬ 
ciples  of  content  validation  to  the  development  of  psychomotor  tests. 

While  their  job  analysis  (Miller,  1947)  was  crude  by  current  standards,  It 
did  serve  to  focus  the  psychomotor  test  development  efforts  of  Melton 
(1947)  and  his  colleagues. 

Pilots  were  the  target  of  the  first  AAF  job  analysis.  An  early  vari¬ 
ant  on  Flanagan’s  (1954)  critical  Incident  technique  was  used  to  collect 
job  analysis  data.  Because  the  goal  of  this  study  was  to  predict  attrition 
from  pilot  training,  job  analysis  data  collection  focused  on  pilot  training 
failures. 

AAF  procedures  during  World  War  II  required  check-pilots  (l.e.,  pilot 
training  instructors)  to  appear  before  a  faculty  board  to  justify  their 
decisions  regarding  the  elimination  of  cadets  from  pilot  training  (Melton, 
1947;  Miller,  1947).  At  these  proceedings,  the  check-pilot  informed  the 
student  and  the  board  members  of  the  reasons  for  recommending  that  the 
cadet  be  withheld  from  further  training.  Usually  the  check-pilot  made 
reference  to  characteristics  of  the  cadet  and  generated  specific  examples 
of  problems  the  cadet  was  experiencing  in  learning  how  to  fly.  The  check- 
pilot’s  statements  and  the  board’s  comments  and  recommendations  were  re¬ 
corded  In  the  Faculty  Board  Proceedings. 

In  1941,  researchers  began  to  analyze  the  Faculty  Board  Proceedings  to 
determine  common  reasons  for  pilot  training  failure.  Initially,  proceed¬ 
ings  were  reviewed  for  300  cadets  eliminated  from  elementary  pilot  train¬ 
ing.  In  1942,  a  second  analysis  was  conducted  based  on  1,000  additional 
cadets  who  had  also  been  eliminated  from  elementary  pilot  training.  Later, 
the  analysis  was  expanded  to  Include  100  cadets  who  had  been  eliminated 
from  advanced  single  engine  pilot  training  and  100  cadets  who  had  been 
eliminated  from  advanced  twin  engine  pilot  training.  Based  largely  on  the 
analysis  of  the  sample  of  1,000  cadets  who  had  been  eliminated  from  elemen¬ 
tary  flight  training,  a  list  of  20  common  reasons  for  pilot  training  fail¬ 
ure  was  compiled.  These  20  reasons  were  grouped  into  four  major 
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categories:  Intelligence  and  judgment,  alertness  and  observation,  coordina¬ 
tion  and  technique,  and  personality  and  temperament. 

Of  particular  Interest  to  those  Involved  In  psychomotor  test  develop¬ 
ment  was  the  third  category,  coordination  and  technique.  This  category 
subsumed  five  reasons  for  pilot  training  failure:  coordination,  appropri¬ 
ateness  of  controls  used,  feel  of  the  controls,  smoothness  of  control 
movement,  and  progress  In  developing  technique.  Table  4  provides  more 
complete  explanations  of  these  five  reasons  for  pilot  training  failure. 
Table  4  also  shows  the  percentage  of  cadets  whose  elimination  was  based  at 
least  In  part  on  each  of  these  reasons.  Of  the  four  major  categories, 
coordination  and  technique  was  the  most  commonly  cited  failure  category  for 
cadets  in  elementary  pilot  training  and  the  second  most  commonly  cited 
failure  category  for  advanced  single  and  twin  engine  pilot  training. 
Therefore,  the  AAF  research  team  decided  to  emphasize  the  development  of 
tests  which  would  reliably  assess  Individual  differences  In  coordination 
and  technique. 

From  the  beginning,  the  AAF  research  team  felt  that  It  would  be  neces¬ 
sary  to  use  apparatus  tests  to  measure  coordination  effectively.  There¬ 
fore,  they  Invested  much  of  ^helr  time  In  designing  new  apparatus  tests 
which  were  similar  In  appearance  and  psychomotor  demands  to  apparatus  and 
Instruments  In  the  cockpit  of  an  airplane.  Among  the  tests  developed  by 
this  team  during  World  War  II  were  the  Two-Hand  Coordination  Test,  the  Two- 
Hand  Pursuit  Test,  the  Rudder  Control  Test,  and  the  Arm-Hand  Steadiness 
Test.  In  addition,  these  researchers  modified  and  Improved  a  number  of 
tests  which  had  previously  been  used  In  pilot  selection  research,  Including 
the  Complex  Coordination  Test  and  the  Rotary  Pursuit  Test.  These  tests  all 
are  described  In  Appendix  A. 

By  maintaining  fidelity  between  the  tests  and  the  job  performance 
domain,  the  AAF  researchers  guaranteed  the  content  validity  of  their  psy¬ 
chomotor  tests.  Their  efforts  to  validate  these  tests  did  not  cease  with 
content  validity,  however.  They  also  evaluated  the  criterion-related 
validity  of  each  test  before  Including  it  In  the  psychomotor  portion  of  the 
ACB.  Moreover,  the  research  team  was  careful  to  Include  new  psychomotor 
tests  In  the  ACB  only  If  they  tapped  variance  in  pilot  training  attrition 
which  had  not  been  tapped  by  other  psychomotor  tests.  This  helped  ensure 
coverage  of  a  broad,  representative  sample  of  the  criterion  domain.  Valid¬ 
ity  analyses  for  the  ACB  Indicated  that  the  psychomotor  apparatus  tests 
were  tapping  unique  variance  In  the  prediction  of  pilot  training  attrition 
--variance  that  was  not  tapped  by  traditional  paper-and-pencil  cognitive 
and  perceptual  tests. 

Another  example  of  content  validation  In  psychomotor  test  development 
Is  provided  by  recent  research  into  the  prediction  of  tank  crew  performance 
(Campbell  &  Black,  1982;  Eaton,  Johnson,  &  Black,  1980).  Based  on  their 
analyses  of  tank  crew  jobs,  these  researchers  identified  several  tasks  that 
are  crucial  to  tank  crew  performance.  Th«y  then  developed  a  series  of  work 
sample  tests  using  these  key  tasks  as  guioes.  Thus,  their  tests  represent 
content  valid  "samples"  rather  than  "signs"  of  criterion  behavior.  Like 
the  AAF  research  team,  these  researchers  followed  up  their  test  development 
efforts  by  collecting  criterion-related  validity  data.  Analyses  of  these 
data  Indicated  that  several  of  the  psychomotor  work  sample  tests  correlated 
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Percentage  of  PUot  Training  Failures  Attributable  in  Whole  or  in  Part 
to  Problems  Involving  Coordination  and  Technique 
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significantly  with  training  performance  measures,  especially  for  the  tank 
gunner  Job. 

These  two  studies,  along  with  many  others  (e.g.,  Eggenberger,  1976; 
Fowler,  1981;  Jones,  1982),  demonstrate  how  measures  developed  using  a 
content  validation  strategy  can  be  useful  In  predicting  training  perfor¬ 
mance  and  job  proficiency.  It  must  be  recognized,  however,  that  content 
validity  alone  does  not  ensure  that  a  selection  test  will  be  a  valid 
predictor  of  job  performance.  Many  selection  researchers,  Including  Cascio 
(1982)  and  Gulon  (1977),  have  advised  that  content  validity  Is  best  viewed 
as  a  test  development  strategy  rather  than  as  a  test  validation  strategy 
(cf.  Klelman  &  Faley,  1978).  In  fact,  Gulon  (1978)  has  even  suggested  that 
a  more  appropriate  name  for  content  validity  would  be  content-oriented  test 
development. 

Thus,  under  no  circumstances  should  content  validity  be  regarded  as  a 
substitute  for  criterion-related  validity.  No  matter  how  careful  re¬ 
searchers  are  In  designing  job  selection  tests,  there  Is  no  guarantee  that 
these  tests  will  be  valid  as  predictors  of  job  performance  criteria. 
Therefore,  criterion-related  validity  data  should  be  collected  and  eval¬ 
uated  whenever  possible.  Indeed,  the  ultimate  goal  of  any  validation  study 
should  be  collection  of  criterion  job  performance  measures  and  calculation 
of  the  correlation  between  these  measures  and  selection  test  scores. 

As  an  Interim  step  before  criterion-related  validity  can  be  estimated, 
researchers  can  be  guided  In  their  choice  of  selection  tests  by  previous 
criterion-related  validity  research.  Recent  Investigations  of  validity 
generalization  by  Ounnette  et  al .  (1982)  and  Pearlman,  Schmidt,  and  Hunter 
(1980)  suggest  that,  for  any  given  job  family,  correlations  between  selec¬ 
tion  tests  and  Job  performance  measures  vary  only  slightly  from  job  to  job 
and  from  organization  to  organization.  As  an  aid  In  determining  which 
selection  tests  are  likely  to  be  valid  predictors  of  performance  for  par¬ 
ticular  jobs,  a  summary  of  previous  validity  research  Is  provided  next. 

The  Criterion-Related  Validity  Literature  Review  Process 

Recently,  Bullock  and  Svytanek  (1985)  showed  that  the  results  of  a 
meta-analysis  or  validity  generalization  study  can  be  greatly  influenced  by 
the  manner  In  which  a  researcher  chooses  to  record  the  characteristics  and 
results  of  studies  under  review.  Therefore,  a  thorough  description  of  the 
methods  used  to  identify  the  literature  on  the  criterion-related  validity 
of  psychomotor  tests  and  to  summarize  the  criterion-related  validity  coef¬ 
ficients  reported  for  the  psychomotor  domain  Is  provided  below. 

Identifying  Relevant  Articles.  Reports,  and  Manuals.  In  the  Preface 
to  this  report,  procedures  used  to  Identify  the  literature  relevant  to  this 
domain  were  described.  Initially,  the  review  of  criterion-related  validity 
studies  focused  on  research  conducted  within  the  past  15  years  (i.e.,  prior 
to  1970).  As  the  sources  Identified  via  the  computerized  literature  search 
were  reviewed,  however,  It  became  apparent  that  very  little  criterion- 
related  validity  research  had  been  conducted  in  the  psychomotor  area  during 
that  period.  Indeed,  with  the  exception  of  several  recent  military  re¬ 
search  studies  Investigating  the  validity  of  psychomotor  tests  as  pre¬ 
dictors  of  pilot  performance  (e.g.,  Hunter  &  Thompson,  1978;  Imhoff  & 
Levine,  1981;  Myers,  Jennings,  Schemmer,  and  Fleishman,  1982)  and  some 
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ongoing  validity  research  with  the  General  Aptitude  Test  Battery  (GATB)  (U. 
S.  Department  of  Labor,  1970),  there  have  been  very,  very  few  research 
programs  devoted  to  an  investigation  of  the  criterion-related  validity  of 
psychomotor  tests.  Moreover,  of  the  military  research  programs  listed 
above,  those  Initiated  by  Imhoff  and  Levine  for  the  Air  Force  Human  Re¬ 
sources  Laboratory  (AFHRL)  and  Myers  et  al .  for  the  Army  Research  Institute 
(ARI)  have  yet  to  yield  any  validity  results.  Thus,  excluding  the  GATB,  it 
was  only  possible  to  accumulate  approximately  300  validity  coefficients 
from  the  studies  Initially  reviewed. 

Consequently,  the  literature  search  was  expanded  to  include  validity 
research  published  during  the  1940s,  1950s,  and  1960s.  This  made  It  pos¬ 
sible  to  Incorporate  several  additional  major  sources  of  psychomotor  valid¬ 
ity  data  Into  the  review,  including  pre-1970  GATB  validity  results  and 
validity  data  from  the  AAF  selection  research  on  pilots,  navigators,  and 
bombardiers  from  World  War  II  (Melton,  1947). 

In  summarizing  all  of  the  articles,  reports,  and  manuals  reviewed  for 
the  psychomotor  domain,  the  research  staff  completed  180  article  review 
forms  and  422  predictor  review  forms.  Not  all  of  the  papers  reviewed 
contained  criterion-related  validity  information,  however;  only  35  arti¬ 
cles,  reports,  and  manuals  reported  criterion-related  validity  coeffi¬ 
cients.  (References  for  these  35  studies  are  provided  In  Appendix  B.)  In 
total,  these  35  studies  yielded  2,373  validity  coefficients  based  on  test 
scores  from  75  different  psychomotor  apparatus  and  paper-and-pencll  tests. 
By  far  the  largest  set  of  these  coefficients  (AM, 734,  or  73.1%)  were 
obtained  from  the  five  GATB  psychomotor  scales.  An  additional  350  coeffi¬ 
cients  (14.7%)  were  obtained  from  the  Army  Air  Forces  aircrew  selection 
research  conducted  during  World  War  II  (Melton,  1947).  These  350  coeffi¬ 
cients  represent  research  results  for  32  different  psychomotor  tests. 

Tabulation  of  Validity  Coefficients.  During  the  review,  it  was  noted 
that  different  authors  tended  to  report  validity  results  differently.  For 
example,  some  researchers  who  Investigated  several  different  samples  of 
subjects  reported  validity  coefficients  for  each  sample  while  others  simply 
reported  a  mean  coefficient  across  samples.  Some  attempted  to  correlate 
psychomotor  test  scores  with  rather  global  criteria  (e.g.,  supervisory 
ratings  of  overall  job  performance)  while  others  reported  separate  validity 
coefficients  for  each  of  several  more  specific  criteria  (e.g.,  performance 
on  several  different  work  sample  tests).  Finally,  for  tests  Involving 
multiple  trials,  some  researchers  reported  separate  validity  coefficients 
for  each  trial  while  others  computed  a  total  score  across  trials  and  based 
their  validity  coefficient(s)  on  this  total  score. 

To  ensure  consistency  In  the  tabulation  of  validity  coefficients,  the 
following  decision  rules  were  adopted: 

1.  If  validity  coefficients  were  reported  for  more  than  one 
sample  of  subjects,  separate  validity  coefficients  were 
recorded  for  each  sample. 

2.  If  validity  coefficients  were  provided  for  more  than  one 
criterion  variable: 
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a.  If  there  was  a  total  or  summary  score,  only  the  validity 
coefficient (s)  for  that  summary  score  was  recorded. 

b.  If  there  was  no  total  or  summary  score,  separate  validity 
coefficients  were  recorded  for  each  criterion  variable. 

3.  If  separate  validity  coefficients  were  recorded  for  each 
trial  (or  for  each  block  of  trials)  on  the  psychomotor  test: 

a.  If  there  was  a  total  or  summary  score,  only  the  validity 
coefficient(s)  for  that  summary  score  was  recorded. 

b.  If  there  was  no  total  or  summary  score,  the  mean  validity 
coefflclent(s)  across  trials  was  computed  and  recorded. 

Special  forms  were  then  prepared  to  facilitate  the  tabulation  of 
validity  data.  The  following  Information  was  recorded  for  each  validity 
assessment: 

1.  Predictor  construct 

2.  Criterion  construct 

3.  Validity  coefficient 

4.  Sample  size 

5.  Research  setting 

6.  Job  type 

7.  Pilots  vs.  non-pilots 

8.  Test 

9.  Reviewer 

10.  Article  review  form  number 

11.  Predictor  review  form  number 

The  predictor  construct  refers  to  the  psychomotor  ability  tapped  by 
the  predictor  test  used  In  the  validity  assessment.  Each  test  was  assigned 
to  one  of  10  different  predictor  constructs.  Nine  of  the  constructs  were 
taken  from  Fleishman’s  taxonomy  of  the  psychomotor  ability  domain:  multi  - 
limb  coordination,  control  precision,  rate  control,  finger  dexterity,  man¬ 
ual  dexterity,  wrist-finger  speed,  aiming,  arm-hand  steadiness,  and  speed 
of  arm  movement.  As  noted  In  the  section  reviewing  the  taxonomy  of  the 
psychomotor  domain,  except  for  reaction  time  and  response  orientation  this 
list  Includes  all  of  the  abilities  Fleishman  Included  in  his  taxonomy  of 
the  psychomotor  domain.  Table  2  in  the  taxonomy  section  includes  a  defini¬ 
tion  of  each  of  these  nine  abilities.  The  tenth  predictor  construct, 
complex  psychomotor  predictors,  includes  all  tests  which  significantly  tap 
two  or  more  of  the  nine  abilities  listed  above.  The  tests  categorized  into 
this  predictor  construct  often  Involved  complex  tasks  such  as  video  games 
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(e.g.,  Jones,  Kennedy,  &  Bittner,  1981)  or  job  simulations  (e.g.,  Eaton, 
Johnson,  &  Black,  1980). 

The  criterion  construct  refers  to  the  type  of  criterion  measure  that 
was  used  to  validate  the  predictor  construct.  Criteria  were  initially 
categorized  Into  one  of  four  major  categories:  educational  and  school 
achievement,  training  performance,  job  proficiency,  and  job  involvement/ 
withdrawal.  Each  of  these  major  categories  was  In  turn  divided  into  2-6 
criterion  constructs,  resulting  in  12  different  criterion  constructs.  Def¬ 
initions  and  explanations  of  these  constructs  are  provided  In  Table  5. 

for  research  setting,  each  study  was  categorized  as  either  military  or 
non-military.  For  the  psychomotor  domain,  this  distinction  was  based 
entirely  on  whether  the  subject  population  was  military  or  civilian. 

The  job  type  classification  scheme  was  derived  from  the  Dictionary  of 
Occupational  Titles  {DOT)  (U.  S.  Department  of  Labor,  1977)  and  Ghiselli’s 
General  Occupational  Classification  system  (Ghiselli,  1966).  Ten  broad  job 
groups  were  initially  identified  from  a  review  of  these  two  sources.  These 
job  groups  were  then  altered  In  light  of  information  concerning  Important 
distinctions  between  different  military  occupational  groups.  For  example, 
because  mechanical  maintenance  and  electronics  comprise  two  broad,  distinct 
job  types  within  the  Army,  the  DOT  Structural  occupation  category  was 
divided  Into  these  two  job  types.  In  total,  nine  job  types  were  used  to 
classify  the  psychomotor  validity  assessments.  These  job  types  are  listed 
In  Table  6.  Table  6  also  Includes  a  short  list  of  sample  military  and  non- 
military  jobs  within  each  job  type. 

Since  so  many  of  the  subjects  used  In  psychomotor  validity  research 
were  pilots  or  pilot  trainees  (AM  16  validity  coefficients,  or  17.5%  of  all 
tabulated  validities),  each  validity  assessment  was  also  classified  as 
either  pilot  or  non-pilot.  This  classification  represents  a  finer  break¬ 
down  of  the  professional  job  type  from  Table  6. 

The  test  refers  to  the  psychomotor  test  used  in  the  validity  assess¬ 
ment.  As  noted  previously,  75  different  psychomotor  tests  were  used  in  the 
criterion-related  validity  studies  reviewed.  These  tests  are  listed  in  Ta¬ 
ble  7.  Table  7  also  lists  the  psychomotor  ability  tapped  by  "  *.h  test  and 
provides  a  reference  to  an  article  In  which  the  test  has  bee  ,»ed  and/or 
described.  Several  of  the  more  widely  used  tests  listed  In  Table  7  are 
pictured  and  described  In  Appendix  A.  Appendix  A  also  contains  summary 
reliability  and  validity  Information  for  these  tests. 

The  reviewer,  article  review  form  number,  and  predictor  review  form 
number  were  tabulated  In  order  to  Identify  the  article,  report,  or  manual 
from  which  each  validity  assessment  was  obtained. 

Method  of  Summarizing  Validity  Information.  Numerous  tables  summar¬ 
izing  the  validity  evidence  for  each  predictor  construct-criterion  con¬ 
struct  pair  were  prepared  and  are  presented  below.  Within  each  predictor- 
criterion  cell,  several  pieces  of  Information  are  reported.  The  median 
validity  coefficient  appears  as  the  first  entry  in  the  cell.  This  is 
followed  by  the  weighted  mean  validity  coefficient,  which  Is  the  mean 
validity  across  all  coefficients  tabulated  for  that  cell.  In  computing 
this  mean,  the  validity  coefficients  were  weighted  by  sample  size.  For 


Table  5 


Criterion  Constructs 


(2JF 

Criterion 

Construct 

Educational 
and  School 
Achievement 

Grades 

Instructor 

evaluations 

Academic  course  grades  or  GPA 

Instructor  ratings  or 
rankings 

Training 

Performance 

Objective 

measures 

Paper-and-pencil  exam  scores, 
achievement  test  scores,  or 
course  grades  based  solely  on 
paper-and-pencil  exams 

Subjective 

measures 

Instructor  ratings  or 
rankings 

Combination 
objective  and 
subjective 
measures 

Final  course  grades  based  on 
paper-and-pencil  test  scores 
and  Instructor  evaluations 
(Note:  Unless  It  was  specifi¬ 
cally  stated  that  training 
course  grades  were  based  on 
objective  exams  or  subjective 
evaluations,  they  were  cate¬ 
gorized  Into  this  "combina¬ 
tion"  construct) 

Go-no  go 
training 
courses 

Pass/fail,  graduate/non¬ 
graduate,  or  successful/un¬ 
successful  outcomes  or  number 
of  washbacks 

Hands-on 

measures 

Work  sample  or  job  sample 
measures  which  are  scored 
objectively  or  based  on 
instructor  evaluations 

(Continued) 
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Table  5  (Continued) 


Criterion  Constructs 


Major 

Cateaorv 

Criterion 

Construct 

Job  Proficiency 

Subjective 

measures 

Supervisor  or  peer  ratings  or 
rankings 

Job-related 

measures 

Job  knowledge  or  work  sample 
tests 

Archival 

measures 

Units  produced,  salary  rates 
or  increases,  promotions, 
etc. 

Job  Involvement/ 
Withdrawal 

Job 

satisfaction 

Job  satisfaction  or  attitude 
survey  ratings 

Job  withdrawal 

Absenteeism,  re-enlistment, 
or  voluntary  turnover 
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Table  6 


Job  Tvoes  and  Sample  Jobs 


Job 

Tvoe 

Sample 

Military 

Jobs 

Sample 

Non-Military 

Jobs 

Professional , 
Technical,  and 
Managerial 

Air  Force  officers 
Pilots 

Navigators 

Intelligence 

Manners, 

supervisors* 

fOr««en 

Engineers 

Health  care  profes 
sionals  (e.g., 
defttal  hy^enlst) 
Pilots 

Dry*ffcs«*n 

Clerical 

Office  clerk 
Administrator 
Personnel 
specialist 
Communication* 
specialist 

Secretary 

Office  clerk 
Switchboard/ 
keyboard  operator 
Telegrapher 

Sales 

None 

Sales  representative 
Sales  clerk 

Protective 

Services 

Military  pol  ,:e 
Combat  soldier 
Infantryman 

General  enlisted 
personnel 
Undifferentiated 
apprentices 

Police  trainees 
Security  guard 
Correction  officer 

Service 

Food  service 

Medical  specialist 

Food  service 

Medical,  dental 
assistant 

Truck  driver 

Mechanical  and 
Structural 
Maintenance 

Aircraft  mechanic 
Vehicle  mechanic 
Munitions  mechanic 

Machinist 

Mechanic 

Carpenter 

Plumber 

Welder 

Appliance  repairman 

(Continued) 
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Table  6  (Continued) 


Job  Types  and  Sample  Jobs 


Job 

Tvoe 

Sample 

Military 

Jobs 

Sample 

Non-Military 

Jobs 

Electronics 

Electronic  and 
radio  repairman 
Radar  repairman 

Sonar  technician 
Surveillance 
specialist 

Radio  operator 

Electronics 

repairman 

Electrical 

technology 

trainees 

Industrial 

None 

Machine  operator 
Processor, 
assembler, 
bench  worker 
Ironworker 

Coal  miner 

General  maintenance 
worker 

Miscellaneous 

Attack  submarine 
trainee 

Power  plant 
operator 
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example,  a  validity  coefficient  obtained  from  a  sample  of  100  subjects  was 
weighted  twice  as  heavily  as  a  validity  coefficient  obtained  from  a  sample 
of  50  subjects.  The  third  entry  is  the  number  of  different  articles, 
reports,  and  manuals  from  which  validity  coefficients  were  obtained  for 
that  cell  {K).  To  a  certain  extent,  K  represents  the  number  of  independent 
studies  which  contributed  validity  coefficients  to  that  cell.  It  should  be 
recalled,  however,  that  the  majority  of  validity  coefficients  summarized  In 
the  tables  were  taken  from  just  two  sources--the  manual  for  the  GATB  (U.  S. 
Department  of  Labor,  1970)  and  the  AAF  World  War  II  research  on  aircrew 
selection  (Melton,  1947) --and  that  these  sources  both  reported  validity 
results  from  many  semi -Independent  research  efforts.  The  fourth  entry  L 
tells  the  number  of  validity  coefficients  Included  in  that  cell,  while  the 
fifth  entry  H  Indicates  the  number  of  different  psychomotor  tests  used  in 
the  validity  assessments  for  that  ~«11.  Finally,  the  N  range  tells  the 
range  of  sample  sizes  for  the  L  v.  idlty  assessments.  The  number  In  par¬ 
entheses  following  this  range  is  the  median  sample  size  across  all  of  the 
studies  Included  In  that  cell. 

Initially,  summary  tables  were  prepared  for  each  job  type  for  both 
military  and  nonmilitary  subject  populations.  Since  there  were  no  validity 
results  for  five  military  job  types,  a  total  of  13  tables  were  prepared. 

All  of  these  13  tables  are  presented  and  discussed  below. 

In  addition,  there  were  some  Instances  where  It  was  useful  to  combine 
different  subject  populations  (e.g.,  across  research  settings  or  across  job 
types)  or  to  examine  special  subject  populations  (e.g.,  pilots)  In  more 
detail.  Thus,  there  are  many  special  summary  tables  Interspersed  among  the 
13  main  summary  tables  below. 

Finally,  at  the  end  of  the  validity  summary  section  there  Is  a  table 
which  summarizes  the  criterion-related  validity  evidence  by  predictor  con¬ 
struct  and  by  major  criterion  category  (e.g.,  job  proficiency)  for  each  job 
type.  This  table  shows  only  the  median  validity  coefficient  and  the  number 
of  validity  coefficients  (£)  for  each  cell. 

The  Criterion-Related  Validity  Evidence  for  Psvchcfrctor  Tests 

Professional.  Technical,  and  Managerial  Jobs.  Table  8  summarizes  the 
criterion-related  validity  coefficients  for  military  professional,  tech¬ 
nical,  and  managerial  jobs.  The  validity  coefficients  In  this  table  have 
been  taken  almost  exclusively  from  military  research  on  aircrew  selection. 
The  vast  majority  of  these  coefficients  are  for  the  job  of  pilot,  while 
almost  all  of  the  remaining  coefficients  are  derived  from  research  on 
navigators. 

Table  8  Indicates  that  psychomotor  abilities  are  quite  powerful  pre¬ 
dictors  of  training  performance  for  these  jobs.  Multilimb  coordination  and 
control  precision  appear  to  be  particularly  valid  predictors.  Median 
validities  for  multilimb  coordination  are  in  the  .20s  for  two  different 
training  performance  constructs,  while  the  median  validity  of  control 
precision  for  predicting  graduation  from  training  school  is  .22  (W» 20 
validity  coefficients).  In  addition,  several  other  psychomotor  abilities 
show  moderate  validity  for  predicting  graduation  from  training.  The  median 
validity  for  rate  control  Is  .12  (W« 6  validity  coefficients),  the  median 
validity  for  manual  dexterity  Is  .22  (tf«2  validity  coefficients),  and  the 
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Validity  Summary  for  Military  Professional.  Technical,  and  Managerial  Jobs 
(N  =  414  Validity  Coefficients) 


*■** 

44 

li 

• 

<2 
n  « 

Ml 

s 

ii 

3 

s 

§ 

«i 

44 

u 

<: 
n  m 

•4 

44 

m 

VI 

K 

**  M 

m  m 

P  w 
•h  a 
x  • 

U  4 
U  • 
<  £ 

* 

• 

U 

o 

M 

• 

X  2 
1X3 

O  H  ■ 

^  i  i 

1 

n 

3  . 

■H  V 
44  4a 

u  a 

V  M 

*-*  m 
2* 

Training  Performance 

s  z 
i  3 

e : 
22 

4“4 

..  i 

•  •  ”  i 
1 

s?s 

w*  m 
OKU 

I-2 

O  H  ° 

•J 

IN 

vi 

rs  fn»  «r 

OONNHN 
•  •  « 

-ft 
«* 

•Jr 

CM 

« 

nn  cm 

HH  flKiftlN 
•  •  1 

•V 

'C 

CM 

to 

w 

CM 

«  CD  nft 

oon»«» 

•  •  H  H 

i 

n 

O  O  -C 

H  f«  H  H  n  •« 

eft 

a 

« o»  H 

vi  O  Cirh  4  n 

•  •  ft  H 

_ _ 2 

e  • 

o  •  »  • 

^  a  •» 

44  «H  44  4n 

5  «  3 

e  u  e  i  • 

■*  $i  m  —i  m 

tg  i* 

y 

3. 

*4  4* 

u  a 
•  « 

■4"1  ■ 

J* 

OS 

•  M 

a  v 

4t  44 
44  9 
U  • 

*3 

S' 

44 

if 

-  t 
32 

O  U 
^  < 
w 

a  h 

3  8 
St 

M 

si 

44  *4 

S3 

U  9 

44  a* 

a 

c  > 

M  taJ 

V  « 
•  ft 

j  4a  *9 

£ 

f 

i 

e 

« 

f 

f 

: 

K 

44 

■ad 

1  3 

A  K  « 

VI  44  > 

S  3  3 

•f  h  • 

c  «  S  • 

7«  >  • 

U.  ^  6 

lev  < 

«  •  44  u  J  S  « 

ft  X 

•  -H  V  M  I 

£  V  *4 

*  '5 

Alnlnt 

Median  Validity 

Weighted  Mean  Validity 

K 

L 

M 

K 

44 

:  ? 

3  2 

2  ►*  * 
t  44  > 

•  1 

«  a  c 

£23 

,2*  3 

c  V  c 

«  c  v  m 

*  a  44  U  .J  s  *8 

1  ^4  X 

■  “Oft*  X 

*•  V  «M 

<  r  V 

3 

“  ft* 

C  44 

i  s 

>  a-ft 

O  K  4 

S  44  > 

1  a  8 

<  H  S 

41  X  V 

*m  >  44 

O  ftp  c 

CM  ft 

V  M  *4UUS8 

ft  «<  X 

ft  V  V  X 

ft  ft  ed 

fr 

•md 

k- 

O  *4 

44  (-« 

§  « 

X  9  8 

U  MM  « 
ft  b  H  ft 

MOMS  V 

0-  *4  »  « 

u  ^  c 

x  -*  c  v  a 

V  T3  a  44  U  J  JC  « 

«  V  H  X 

CL  4a  o  ft*  X 

E  4,  4  iH 

sr^s 

64 


(Continued) 


Table  8  (continued) 

Sinwmarv  for  Military  Professional  ^Technical,  and  Managerial  Jobs 
(N  =  414  Validity  Coefficients). 
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median  validity  for  aiming  is  .13  (/V- 7  validity  coefficients).  Table  8 
also  shows  that  hands-on  measures  of  training  performance  are  not  well 
predicted  by  psychomotor  test  scores.  The  one  exception  to  this  may  be 
complex  psychomotor  tests  (e.g.,  work  sample  tests),  which  have  a  median 
correlation  of  .40  with  hands-on  training  performance  measures  (/V-8  valid¬ 
ity  coefficients). 

Table  9  contains  a  summary  of  the  validity  evidence  for  pilots  only. 
The  great  majority  of  these  validity  coefficients  were  obtained  from  mili¬ 
tary  research,  but  the  table  does  contain  some  validity  data  for  nonmili¬ 
tary  pilots,  also.  Table  9  provides  clear  evidence  of  the  validity  of 
psychomotor  tests  for  predicting  successful  completion  of  pilot  training. 
Again,  multilimb  coordination  and  control  precision  appear  to  be  the  two 
abilities  most  directly  related  to  pilot  training  performance.  There  is 
some  evidence  that  rate  control  and  aiming  may  be  useful  predictors  as 
well . 


Table  10  contains  a  summary  of  the  validity  evidence  for  nonmilitary 
professional,  technical,  and  managerial  jobs.  The  only  abilities  for  which 
there  is  sufficient  evidence  to  evaluate  the  probable  validity  are  finger 
dexterity,  manual  dexterity,  and  wrist-finger  speed.  Almost  all  of  the 
studies  involving  these  abilities  have  been  conducted  with  the  GATB. 

Table  10  suggests  that  all  three  of  these  abilities  are  moderately 
valid  predictors  of  job  performance  for  this  job  type.  Median  correlations 
between  these  abilities  and  subjective  measures  of  job  proficiency  (e.g., 
supervisory  ratings)  are  all  near  .15.  There  are  not  sufficient  data  to 
evaluate  the  correlations  between  these  abilities  and  training  performance. 
The  data  do  suggest,  however,  that  finger  dexterity  and  manual  dexterity 
are  not  valid  predictors  of  educational  and  school  achievement  for  this  job 
type.  The  validity  data  for  educational  and  school  achievement  are  only 
slightly  more  promising  for  wrist-finger  speed.  For  all  three  abilities, 
the  median  validity  for  educational  and  school  achievement  criteria  is  only 
approximately  .10. 

Table  11  provides  a  summary  of  the  validity  evidence  for  all  profes¬ 
sional,  technical,  and  managerial  jobs  (l.e.,  both  military  and  nonmili¬ 
tary).  Because  the  jobs  represented  in  the  military  validity  summary  table 
for  this  job  type  differ  considerably  from  those  represented  in  the  non- 
military  summary  table,  it  is  difficult  to  draw  any  general  conclusions 
from  Table  11.  It  does  appear,  however,  that  several  psychomotor  abilities 
have  at  least  moderate  validity  for  predicting  training  performance  and  job 
proficiency  for  these  jobs.  This  is  somewhat  surprising,  since  it  is  not 
readily  apparent  how  motor  skills  are  related  to  training  performance  and 
job  proficiency  for  most  of  the  jobs  in  this  job  type.  (The  job  of  pilot 
is,  of  course,  an  exception.)  One  possible  explanation  may  be  that  these 
validities  are  unrelated  to  psychomotor  ability.  In  the  next  section, 
there  will  be  a  discussion  of  the  correlations  between  psychomotor  ability 
measures  and  cognitive  ability  measures.  Almost  all  of  these  correlations 
are  low  to  moderate  and  positive.  Thus,  for  this  particular  job  type,  it 
may  be  that  the  low  positive  correlations  between  psychomotor  abilities  and 
criterion  measures  of  training  performance  and  job  proficiency  are  attrib¬ 
utable  to  cognitive  ability  variance  embedded  in  the  psychomotor  measures. 
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Table  9 

Validity  Summary  for  All  Pilots  (N  °  414  Validity  Coefficients! 
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Table  9  (continued) 

Validity  Summary  for  All  Pilots  (N  =  414  Validity  Coefficients! 
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793  Validity  Coefficients 
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Hedian  Validity 
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Median  Validity 
Weighted  Mean  Validity 


Validity  Sumwarv  for  Nonmilitarv  Clerical  Jobs  (N  -  143  Validity  Coefficients! 
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Clerical  Jobs.  No  validity  data  were  identified  for  clerical  jobs  in 
the  military.  Therefore  Table  12,  which  summarizes  the  validity  data  for 
nonmilitary  clerical  jobs,  Is  the  only  summary  table  for  this  job  type. 

Validity  data  are  only  available  for  three  abilities  for  this  job 
type:  finger  dexterity,  manual  dexterity,  and  wrist-finger  speed.  In  fact, 
all  of  the  validity  data  summarized  in  Table  12  are  from  studies  involving 
the  GATB  psychomotor  scales. 

All  three  abilities  appear  to  be  relatively  valid  predictors  for 
nonmilitary  clerical  jobs.  For  example,  most  of  the  median  correlations 
between  these  abilities  and  job  proficiency  criteria  fall  between  .10 
and  .20.  Validity  coefficients  for  educational  and  school  achievement 
criteria  are  somewhat  higher.  All  but  one  of  the  median  validities  In 
these  six  predictor-criterion  cells  are  between  .25  and  .38.  Unfortu¬ 
nately,  there  are  not  sufficient  data  to  evaluate  the  criterion-related 
validity  of  these  abilities  for  training  performance  or  job  Involvement/ 
withdrawal  criteria. 

Information  in  Table  12  also  reveals  that  the  validities  for  wrist- 
finger  speed  tend  to  be  slightly  higher  than  the  validities  for  finger  and 
manual  dexterity  across  most  of  the  criterion  constructs  for  which  validity 
data  are  available.  While  the  validity  differences  between  wrist-finger 
speed  and  the  other  two  abilities  are  not  overwhelming,  they  do  suggest 
that  wrist-finger  speed  might  be  the  first  choice  for  a  psychomotor  pre¬ 
dictor  for  this  job  type. 

Sales  Jobs.  Since  there  are  no  data  for  sales  jobs  In  the  military 
(see  Table  6),  validity  data  were  summarized  for  nonmilitary  sales  jobs 
only.  Table  13  contains  this  validit;  summary. 

In  total,  only  15  validity  coefficients  were  Identified  for  this  job 
type.  These  validity  coefficients  were  obtained  from  five  studies  in  which 
the  GATB  was  used  to  predict  supervisory  ratings  of  job  proficiency.  As 
Table  13  shows,  the  median  validity  coefficients  across  these  five  studies 
are  very  small  for  all  three  psychomotor  abilities  assessed  by  the  GATB. 

The  median  validity  for  wrist-finger  speed  is  only  .14,  and  the  median 
validities  for  finger  dexterity  and  manual  dexterity  are  both  only  .08. 

These  data  indicate  that  psychomotor  abilities  add  little  to  the 
prediction  of  job  proficiency  for  sales  jobs.  It  must  be  recalled,  of 
course,  that  this  conclusion  Is  based  on  very  limited  data.  Nevertheless, 
It  seems  unlikely  that  many  sales  positions  would  require  extensive  psycho- 
motor  skills.  Indeed,  the  lack  of  psychomotor  validity  data  probably 
reflects  the  belief  of  many  psychologists  that  psychomotor  predictors  do 
not  warrant  Investigation  for  this  job  type.  The  sketchy  data  available 
from  the  GATB  seem  to  support  this  belief. 

Protective  Service  Jobs.  Table  14  summarizes  the  validity  data  for 
military  protective  service  jobs.  Most  of  the  validity  coefficients  In 
this  table  are  the  result  of  the  AAF’s  World  War  II  aircrew  selection 
research  on  bombardiers  (Melton,  1947).  The  validity  coefficients  for 
complex  psychomotor  predictors  were  taken  from  recent  research  Into  the 
selection  of  tank  crewmen  (Campbell  &  Black,  1982;  Eaton  et  al.,  1980). 
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Validity  Summary  for  Nonmilitarv  Sales  Jobs  (N  «  15  Validity  Coefficients! 
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Table  13  (continued) 

Validity  Summary  for  Nonmilitarv  Sales  Jobs  (N  -  15  Validity  Coefficients! 
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Table  14 

Validity  Summary  for  Military  Protective  Service  Jobs  (N_  =  118  Validity  Coefficients} 
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Table  14  (continued) 

Validity  Summary  for  Military  Protective  Service  Jobs  (N  -  118  Validity  Coefficients! 
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Table  14  shows  that  multi! Imb  coordination,  control  precision,  and 
finger  dexterity  measures  are  all  moderately  valid  predictors  of  graduation 
from  bombardier  training.  Median  r's  range  from  .12  to  .20.  Validity 
coefficients  were  somewhat  lower  when  hands-on  training  measures  were  used 
as  criteria.  For  bombardiers,  the  most  common  hands-on  measure  was  the 
distance  between  the  bomb  target  and  the  actual  spot  where  the  bombs  were 
dropped.  The  median  correlation  between  hands-on  training  measures  and 
measures  of  multilimb  coordination  was  .23  (/V- 14  validity  coefficients), 
while  median  validities  for  rate  control,  finger  dexterity,  and  arm-hand 
steadiness  measures  using  this  same  criterion  were  .05  (AM  validity  coef¬ 
ficients),  .06  (/V- 5  validity  coefficients),  and  .06  (A/-3  validity  coef¬ 
ficients),  respectively. 

Thirty-eight  validity  coefficients  were  tabulated  from  research  on  the 
prediction  of  tank  crew  training  performance  using  complex  psychomotor 
predictors.  The  median  value  of  these  38  validity  coefficients  Is 
only  .07.  The  validities  vary  greatly  from  test  to  test  and  from  criterion 
measure  to  criterion  measure,  however.  For  example,  the  Gunner  Tracking 
Task  used  by  Campbell  and  Black  (1982)  had  moderately  high  predictive 
validity  (median  r-.17,  Af-2  validity  coefficients).  Additional  research 
may  show  that  certain  complex  psychomotor  predictors  are  quite  useful  for 
predicting  success  In  military  protective  service  jobs. 

Table  15  contains  a  summary  of  the  validity  data  for  nonmilitary  pro¬ 
tective  service  jobs.  Again,  very  few  validity  coefficients  are  available. 
Only  five  criterion-related  validity  studies  were  Identified  for  this  job 
family- -all  Involving  the  GATB. 

Table  15  Indicates  that  wrist-finger  speed  Is  a  moderately  valid  pre¬ 
dictor  of  subjective  measures  of  job  proficiency  (median  r-.23,  A/-5  validi¬ 
ty  coefficients).  Median  validities  are  somewhat  lower  for  finger  and 
manual  dexterity  (r-,10  and  .15,  respectively,  /V-5  validity  coefficients 
each).  While  there  are  too  few  coefficients  to  draw  any  definitive  con¬ 
clusions,  the  data  do  suggest  that  all  three  of  these  psychomotor  abilities 
may  have  moderate  validity  for  predicting  supervisory  job  proficiency 
ratings. 

Table  16  provides  a  complete  summary  of  the  validity  data  for  this  job 
type  (l.e.,  combining  military  and  nonmilitary  research).  Based  on  this 
table,  It  appears  that  multilimb  coordination  and  wrist-finger  speed  are 
the  two  psychomotor  abilities  with  the  highest  criterion-related  validities 
for  protective  service  jobs.  Median  validities  for  these  two  abilities 
across  all  criteria  are  near  .20.  In  addition,  control  precision,  finger 
dexterity,  and  manual  dexterity  appear  to  have  moderate  criterion-related 
validity  (l.e.,  .10<r<.20)  for  this  job  type. 

In  sum,  even  though  relatively  few  criterlor-related  validity  coeffi¬ 
cients  were  Identified  for  protective  service  jobs,  those  data  which  have 
been  collected  suggest  that  psychomotor  abilities  may  indeed  be  useful  for 
predicting  training  performance  and  job  proficiency  for  these  jobs. 

Service  Jobs.  Tables  17  and  18  contain  summaries  of  the  criterion- 
related  validity  evidence  for  military  and  nonmilitary  service  jobs,  re¬ 
spectively. 
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Table  15 

Validity  Summary  for  Noninllitarv  Protective  Service  Jobs  (N  =  15  Validity  Coefficients! 
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Validity  Summary  for  Honiailitarv  Protective  Service  Jobs  fH  =  15  Validity  Coefficients! 
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Table  16 

Valid itv  Summary  for  All  Protective  Service  Jobs  (N  ~  133  Validity  Coefficients 
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Table  16  (continued) 
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Table  17 

Validity  summary  for  Military  Service  Jobs  (N  =  8  Validity  Coefficients! 
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TaLle  17  (continued) 

Valinltv  Summary  for  Military  Service  Jobs  (N  =  8  Validity  Coefficients) 
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Validity  Summary  for  Nonmilltarv  Service  Jobs  (N  -  162  Validity  Coefficients! 
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Table  18  (continued) 

Validity  summary  for  Noniwilitarv  Service  Jobs  (N  »  162  Validity  Coefficients! 
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In  total,  only  eight  validity  coefficients  were  identified  for  mili¬ 
tary  service  jobs.  This  is  insufficient  to  draw  any  conclusions  regarding 
the  criterion-related  validity  of  psychomotor  tests  for  these  jobs. 

The  validity  coefficients  in  Table  18  were  derived  from  two  sources. 
The  first  was  a  concurrent  validity  study  of  Washington,  OC  taxicab  drivers 
reported  by  Farr,  O’Leary,  Pfeiffer,  Goldstein,  and  Bartlett  (1971).  All 
of  the  validities  for  job-related  and  archival  measures  of  job  proficiency 
were  taken  from  this  study.  The  remaining  validity  coefficients  were  based 
on  research  with  the  GATB  psychomotor  scales. 

Median  validities  from  the  taxicab  study  are  all  quite  low,  ranging 
from  -.16  to  .10.  These  data  suggest  that  multilimb  coordination,  rate 
control,  and  complex  psychomotor  predictors  (In  this  case,  driving  simula¬ 
tors)  are  unrelated  to  job  proficiency  criteria.  It  should  be  noted, 
however,  that  the  reliability  of  job-related  and  archival  measures  of  job 
proficiency  tends  to  be  fairly  low.  An  examination  of  all  of  the  validity 
summary  tables  which  were  prepared  for  this  section  indicated  that  valid¬ 
ities  tended  to  be  lower  for  these  two  criteria  than  they  were  for  other 
educational  and  school  achievement,  training  performance,  or  job  profi¬ 
ciency  criteria.  Thus,  while  the  validities  for  taxicab  drivers  in  Table 
18  are  certainly  not  encouraging,  It  may  be  that  the  low  correlations 
reflect  problems  in  the  selection  and  measurement  of  the  criterion  rather 
than  a  lack  of  criterion-related  validity. 

In  contrast,  validity  coefficients  based  on  GATB  research  Indicate 
that  finger  dexterity,  manual  dexterity,  and  wrist-finger  speed  are  all 
valid  predictors  of  several  different  criteria.  For  example,  median 
validities  for  Instructor  evaluations  range  from  .25  to  .42.  Median  abil¬ 
ity-correlations  with  subjective  measures  of  training  performance  range 
from  .29  to  .50.  Finally,  the  criterion-related  validity  for  subjective 
measures  of  job  proficiency  ranges  from  .13  to  .20  for  these  three  abil¬ 
ities. 

In  general,  comparing  across  criteria,  there  appears  to  be  little  dif¬ 
ference  In  the  criterion-related  validity  of  these  three  abilities.  Com¬ 
paring  across  abilities,  subjective  criterion  measures  for  several  dif¬ 
ferent  criterion  categories  (e.g.,  instructor  evaluations  of  educational 
and  school  achievement,  subjective  measures  of  training  performance,  and 
subjective  measures  of  job  proficiency)  appear  to  be  predictable  from 
psychomotor  ability  tests.  Thus,  for  this  job  type,  psychomotor  abilities 
may  be  useful  for  predicting  both  training  and  job  proficiency  criteria. 

Table  19  summarizes  the  validity  data  for  all  service  jobs.  Since 
military  research  has  contributed  very  little  validity  data  for  this  job 
type,  combining  military  and  nonmilitary  validity  data  did  not  result  in 
any  additional  conclusions  beyond  those  already  described  above  for  non¬ 
military  service  jobs. 

Mechanical  and  Structural  Maintenance  Jobs.  No  military  criterion- 
related  validity  research  was  located  for  mechanical  and  structural  mainte¬ 
nance  jobs.  Therefore,  the  validity  summary  for  nonmilitary  mechanical 
and  structural  jobs  in  Table  20  contains  all  the  validity  coefficients 
which  were  Identified  for  this  job  type. 
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validity  Summary  for  Nonmilitarv  Mechanical  and  Structural  Maintenance  Jobs 
fN  *  184  Validity  Coefficients) 
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Table  20  (continued) 

Validity  summary  for  Nonmilitary  Mechanical  and  Structural  Maintenance  Jobs 
fN  =  184  Validity  Coefficients) 
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Weighted  Mean  Validity 


The  vast  majority  of  the  validity  data  summarized  In  Table  20  are 
based  on  research  with  the  GATB  psychomotor  scales.  Several  other  psycho¬ 
motor  tests  and  studies  are  also  represented  In  this  table,  however. 

Sufficient  validity  data  are  available  to  evaluate  the  criterion- 
related  validity  of  only  three  psychomotor  abilities:  finger  dexterity, 
manual  dexterity,  and  wrist-finger  speed.  All  three  abilities  appear  to 
have  moderate  validity  for  predicting  educational  and  school  achievement. 
Median  validities  for  these  three  abilities  across  the  two  educational  and 
school  achievement  criteria  range  from  .16  to  .20.  The  few  validities 
available  for  training  criteria  suggest  that  training  performance  may  be 
even  more  predictable  than  educational  and  school  achievement  for  this  job 
type.  Median  validities  for  finger  dexterity  and  manual  dexterity  range 
from  .18  to  .28.  Correlations  between  wrist-finger  speed  and  training 
performance  are  somewhat  lower,  however  (median  r-.06,  /Y-3  validity  coef¬ 
ficients).  Finally,  there  Is  little  difference  In  the  median  validities  of 
these  three  abilities  for  subjective  measures  of  job  proficiency.  Median 
correlations  are  .21,  .21,  and  .14  for  finger  dexterity,  manual  dexterity, 
and  wrist-finger  speed,  respectively  (/V- 39,  40,  and  38  validity  coeffi¬ 
cients,  respectively). 

In  summary,  criterion-related  validity  research  shews  that  finger  dex¬ 
terity,  manual  dexterity,  and  wrist-finger  speed  are  all  equally  valid 
predictors  of  educational  and  school  achievement  and  subjective  measures  of 
job  proficiency  for  mechanical  and  structural  maintenance  jobs.  Median 
validity  coefficients  for  all  three  abilities  for  these  criteria  tend  to 
range  from  .15  to  .20.  The  validity  coefficients  available  for  training 
performance  and  job  proficiency  suggest  that  manual  dexterity  and  finger 
dexterity  might  be  more  valid  predictors  than  wrist-finger  speed. 

Electronics  Jobs.  Table  21  summarizes  the  few  validity  coefficients 
Identified  for  electronics  jobs.  All  of  the  psychomotor  criterion-related 
validity  research  Identified  for  this  job  type  was  conducted  with  non- 
military  subjects.  The  only  tests  used  in  this  research  were  the  GATB 
Finger  Dexterity,  Manual  Dexterity,  and  Motor  Coordination  scales.  In 
total,  only  30  validity  coefficients  were  located,  10  for  each  of  the  three 
GATB  scales. 


The  validity  data  which  do  exist  suggest  that  finger  dexterity  (median 
r-.17,  tf-3  validity  coefficients)  and  wrist-finger  speed  (median  r-.12,  A/-3 
validity  coefficients)  are  moderately  valid  predictors  of  school  grades  for 
this  job  type.  Median  validities  for  subjective  measures  of  job  proficien¬ 
cy  are  comparable  for  the  three  abilities.  The  median  validity  for  finger 
dexterity  Is  .15,  the  median  validity  for  manual  dexterity  Is  .12,  and  the 
median  validity  for  wrist-finger  speed  is  .16  (N*7  validity  coefficients 
each). 


Based  on  the  validity  data  summarized  In  Table  21, 
determine  If  any  of  the  four  major  criterion  categories 
able  than  others.  Table  21  does  suggest,  however,  that 
related  validity  for  manual  dexterity  is  lower  than  for 
wrist-finger  speed. 


it  is  difficult  to 
are  more  predict- 
the  criterion- 
finger  dexterity  or 
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Table  21 

Validity  Summary  for  Nonmllitarv  Electronics  Jobs  (N  =  30  Validity.  Coeff IcientaJ. 
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(Continued) 


Table  21  (continued) 

Validity  Summary  for  Nonwilitarv  Electronics  Jobs  (N  =  30  Validity  Coefficients), 
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Industrial  Jobs.  All  of  the  validity  coefficients  for  industrial  jobs 
are  summarized  In  Table  22.  Since  there  are  no  data  on  Industrial  jobs  in 
the  military  (see  Table  6),  Table  22  includes  only  nonmilitary  research. 

As  with  most  of  the  other  nonmilitary  validity  summary  tables  described 
thus  far,  Table  22  is  dominated  by  validity  coefficients  obtained  from  6ATB 
research.  For  example,  except  for  two  studies  Involving  complex  psycho¬ 
motor  predictors,  the  only  predictors  used  In  the  validity  research  sum¬ 
marized  In  Table  22  have  been  measures  of  finger  dexterity,  manual  dexteri¬ 
ty,  or  wrist-finger  speed.  Some  of  the  median  validities  reported  in 
Table  22,  however,  are  based  on  research  with  a  number  of  different  finger 
dexterity  or  manual  dexterity  tests  (see  the  H  entry  in  the  summary  table). 
This  should  mean  that  the  conclusions  below  are  general  to  validity  results 
which  would  be  obtained  across  a  wide  variety  of  tests  of  finger  and  manual 
dexterity. 

Results  for  educational  and  school  achievement  indicate  that  both 
finger  and  manual  dexterity  are  valid  predictors.  Median  validities  for 
these  two  abilities  across  the  two  educational  and  school  achievement 
criteria  range  from  .22  to  .31.  The  median  validities  for  wrist-finger 
speed  are  somewhat  lower,  however  (r-.08  with  school  grades  and  .18  with 
instructor  evaluations,  //- 4  validity  coefficients  each). 

Results  are  very  similar  for  subjective  measures  of  training  perfor¬ 
mance.  Both  finger  dexterity  and  manual  dexterity  possess  moderately  high 
criterion-related  validity,  while  the  criterion-related  validity  of  wrist- 
finger  speed  is  somewhat  lower.  In  studies  involving  hands-on  measures  of 
training  performance,  however,  neither  finger  nor  manual  dexterity  was 
significantly  related  to  the  criterion  measure. 

Validity  results  varied  significantly  across  the  three  job  proficiency 
criteria.  Median  validities  for  all  three  abilities  for  subjective  mea¬ 
sures  of  job  proficiency  were  near  .25.  Median  validities  for  archival  job 
proficiency  measures  were  slightly  lower,  ranging  from  .17  to  .21.  Median 
validities  for  job-related  measures  of  job  proficiency  (i.e.,  hands-on  job 
performance  tests)  were  substantially  lower  than  the  median  validities  for 
the  other  two  job  proficiency  criteria,  ranging  from  .00  for  finger  dex¬ 
terity  to  .14  for  manual  dexterity  and  wrist-finger  speed  (/V- 1  validity 
coefficient  each).  As  noted  previously  in  the  validity  summary  for  service 
jobs,  throughout  the  validity  summary  tables  median  validities  for  job- 
related  and  archival  measures  of  job  proficiency  tend  to  be  lower  than  the 
median  validities  for  other  criteria.  This  suggests  that  there  may  be 
reliability  problems  associated  with  the  measurement  of  these  two  job 
proficiency  criteria. 

Finally,  Table  22  suggests  that  job  withdrawal  (i.e.,  turnover  and 
absenteeism)  is  not  readily  predictable  from  manual  dexterity  for  this  job 
type  (median  r-.02,  tf-3  validity  coefficients).  The  only  validity  coeffi¬ 
cient  identified  for  job  withdrawal  and  finger  dexterity  is  somewhat  higher 
(r» . 14) ,  suggesting  that  there  may  be  some  value  to  investigating  this 
psychomotor  ability  as  a  predictor  of  job  withdrawal  for  industrial  jobs. 

A  comparison  of  the  criterion-related  validity  of  finger  dexterity, 
manual  dexterity,  and  wrist-finger  speed  indicates  that  the  median 
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Table  22 

Validity  Summary  for  Nonmilitarv  Industrial  Jobs  (N  °  814  Validity  Coefficients! 
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Table  22  (continued) 
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Cowplex  Fsychoaotor 
Predictors 

Median  Validity  .14 

Weighted  Mean  Validity  .14 


validities  for  wrist-finger  speed  are  somewhat  lower  than  those  for  finger 
dexterity  and  manual  dexterity.  This  Is  especially  evident  for  educational 
and  school  achievement  and  training  performance  criteria.  In  addition,  for 
these  two  criterion  categories,  the  criterion-related  validity  of  finger 
dexterity  tends  to  be  slightly  higher  than  the  criterion-related  validity 
of  manual  dexterity.  For  job  proficiency  criteria,  the  median  criterion- 
related  validities  for  all  three  abilities  are  virtually  identical. 

Thus,  the  validity  coefficients  summarized  in  Table  22  once  again  il¬ 
lustrate  the  criterion -related  validity  of  psychomotor  abilities  for  pre¬ 
dicting  educational  and  school  achievement,  training  performance,  and  job 
proficiency.  For  two  of  these  criterion  categor1es--educational  and  school 
achievement  and  training  performance- -the  criterion-related  validities  for 
finger  dexterity  and  manual  dexterity  are  higher  than  those  for  wrist- 
finger  speed.  Of  special  Interest  In  Table  22  is  one  study  which  indicates 
that  finger  dexterity  might  be  a  valid  predictor  of  job  withdrawal.  More¬ 
over,  when  there  are  differences  in  the  median  validities  for  finger  dex¬ 
terity  and  manual  dexterity,  those  for  finger  dexterity  tend  to  be  slightly 
higher.  These  findings  suggest  that  finger  dexterity  might  be  the  single 
best  psychomotor  ability  for  predicting  the  various  criteria  for  this  job 
type. 


Miscellaneous  Jobs.  Table  23  summarizes  the  criterion-related  valid¬ 
ity  data  for  military  miscellaneous  jobs.  Only  seven  validity  coefficients 
were  identified  for  this  table.  This  Is  insufficient  to  draw  any  general 
conclusions  about  the  validity  of  psychomotor  tests  for  these  jobs. 

Table  24  contains  a  summary  of  the  validity  evidence  for  nonmilitary 
miscellaneous  jobs.  Jobs  classified  into  this  job  type  Included  air  traf¬ 
fic  controller,  college  student,  dairy  farm  hand,  lemon  picker,  order 
filler,  and  power  plant  operator. 

Virtually  all  of  the  84  validity  coefficients  summarized  In  Table  24 
were  taken  from  GATB  research.  Only  one  of  the  validity  coefficients 
Included  in  this  table  was  not  based  on  one  of  the  GATB  psychomotor  scales. 
Thus,  Table  24  summarizes  the  criterion-related  validity  evidence  for  a 
wide  variety  of  jobs  using  only  a  very  few  psychomotor  tests. 

These  problems--especially  the  problems  associated  with  the  wide  va¬ 
riety  of  jobs--make  It  difficult  to  Interpret  the  median  validities  shown 
In  the  table.  For  example,  the  median  correlation  between  manual  dexterity 
and  school  grades  Is  -.02,  but  the  median  correlation  between  manual  dex¬ 
terity  and  subjective  measures  of  job  proficiency  is  .24.  How  are  these 
results  to  be  reconciled?  One  plausible  explanation  Is  that  the  jobs  for 
which  school  grades  were  used  as  a  criterion  differ  substantially  from 
those  for  which  subjective  job  proficiency  measures  were  used  as  a  cri¬ 
terion.  In  this  case,  that  may  Indeed  be  a  correct  explanation.  For  the 
18  validity  observations  for  which  school  grades  were  used  as  a  criterion, 
the  "job"  was  most  often  college  student  with  major  field  of  study  unspeci¬ 
fied.  In  contrast,  one  of  the  jobs  for  which  subjective  measures  of  job 
proficiency  was  used  as  the  criterion  was  lemon  picker.  Since  the  psycho¬ 
motor  abilities  relevant  to  success  in  these  two  "jobs"  seem  to  overlap 
little  if  at  all,  it  Is  not  surprising  that  there  is  such  a  large  differ¬ 
ence  between  the  median  validities  in  these  two  cells  of  the  table. 
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Table  23  (continued) 

Validity  Summary  for  Military  Miscellaneous  Jobs  (N  -  7  Validity  Coefficients! 
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Table  24 

Validity  Summary  for  Nonmilitarv  Miscellaneous  Jobs  (N  =  84  Validity  Coefficients! 
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Table  24  (continued) 
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In  general,  In  spite  of  these  problems,  the  median  validities  reported 
In  Table  24  are  moderate  to  high.  Excluding  the  manual  dexterity-school 
grades  cell  and  all  predictor-criterion  cells  with  only  one  validity  ob¬ 
servation,  median  validities  range  from  .16  to  .24.  Median  validities  are 
slightly  higher  for  wrist-finger  speed  than  for  finger  or  manual  dexterity. 

In  summary,  Table  24  demonstrates  that  psychomotor  abilities  are 
related  to  educational  and  job  proficiency  criteria  for  a  number  of  miscel¬ 
laneous  jobs.  Because  of  the  range  of  jobs  included  in  Table  24,  however, 
it  is  difficult  to  draw  any  specific  conclusions  about  the  criterion- 
related  validity  of  any  particular  psychomotor  ability  for  any  specific 
class  of  miscellaneous  jobs.  Thus,  the  conclusions  which  can  be  drawn 
based  on  this  cable  are  quite  limited. 

Table  25  contains  a  summary  of  the  validity  data  for  all  miscellaneous 
jobs  (l.e.,  both  military  and  nonmilitary).  Since  so  few  validity  coeffi¬ 
cients  were  Identified  for  military  jobs,  the  conclusions  which  can  be 
drawn  from  Table  25  do  not  differ  from  those  summarized  above  for  Table  24.. 

Validity  Results  bv  Research  Setting.  Tables  26  and  27  provide  over¬ 
all  summaries  of  the  validity  data  for  military  and  nonmilitary  research, 
respectively. 

Table  26  shows  that  most  psychomotor  research  In  the  military  has 
focused  on  coordination  and  tracking  abilities,  such  as  multilimb  coordina¬ 
tion,  control  precision,  rate  control,  aiming,  and  arm-hand  steadiness.  By 
contrast,  there  has  been  relatively  little  research  into  the  criterion- 
related  validity  of  dexterity  abilities  such  as  manual  dexterity  and  wrist- 
finger  speed.  There  have  been  a  number  of  studies,  however,  investigating 
the  criterion-related  validity  of  finger  dexterity. 

The  validity  results  for  military  research  show  that  median  validities 
for  multilimb  coordination  tend  to  be  higher  than  those  for  any  other 
psychomotor  ability.  These  median  validities  average  more  than  .20  across 
several  different  training  performance  criteria.  Other  abilities  with  mod¬ 
erate  to  high  median  validities  include  control  precision  (median  r«.19 
with  graduation  from  training,  /V-27  validity  coefficients),  aiming  (median 
r-.13  with  graduation  from  training,  Af-7  validity  coefficients),  and  rate 
control  (median  r-.12  with  graduation  from  training,  /Y-6  validity  coef¬ 
ficients).  Median  validities  for  finger  dexterity,  wrist-finger  speed,  and 
arm-hand  steadiness  average  .10  or  less  across  criteria,  indicating  that 
these  abilities  have  little  or  no  criterion-related  validity  for  the  mili¬ 
tary  jobs  studied.  Validity  results  for  studies  using  complex  psychomotor 
predictors  vary  greatly.  Median  validities  are  .15  for  graduation  from 
training  (W«47  validity  coefficients)  and  .10  for  hands-on  training  mea¬ 
sures  (TV-46  validity  coefficients),  indicating  that  complex  psychomotor 
predictors  were  only  moderately  valid  on  the  average.  There  has  been 
insufficient  research  to  draw  any  conclusions  regarding  the  criterion- 
related  validity  of  manual  dexterity  and  speed  of  arm  movement. 

The  validity  results  summarized  in  Table  26  are  based  on  a  very  unrep¬ 
resentative  sample  of  military  jobs.  The  vast  majority  of  these  validity 
coefficients  are  based  on  research  with  pilots  (/V-396  validity  coeffi¬ 
cients,  or  72.4%).  None  of  the  studies  reviewed  for  Table  26  involved 
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Table  26  (continued) 

Validity  Summary  for  All  Military  Jobs _ (N  a_5_47  Validity,, Coefficients! 
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Median  Validity  .07  .21  .21  .26  .2*  .20  .07  .22 

Weighted  Mean  Validity  .06  .20  .21  .28  .22  I  .20  .08  .20 


Validity  Summary  for  All  Noniwilitarv  Jobs  (N  -  1826  Validity  Coefficients) 
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Infantrymen,  mechanical  or  electronics  repair  specialists,  or  clerical  or 
administrative  specialists.  Thus,  these  validity  results  may  have  little 
or  no  relevance  for  a  number  of  large  job  families  within  the  Army.  Never¬ 
theless,  the  many  moderate  to  high  validities  in  Table  26  demonstrate  that 
psychomotor  abilities  have  been  valid  predictors  for  the  military  In  the 
past  and  should  be  carefully  considered  for  Inclusion  In  new  selection  and 
classification  batteries. 

Table  27  shows  that  virtually  all  nonmllltary  psychomotor  validity  re¬ 
search  has  focused  on  determining  the  criterion-related  validity  of  finger 
dexterity,  manual  dexterity,  and  wrist-finger  speed.  Of  course,  these  are 
the  three  psychomotor  abilities  assessed  by  the  GATB.  Indeed,  of  the  1,826 
validity  coefficients  tabulated  for  Table  27,  1,734  (95.0%)  are  from  the 
GATB. 


The  median  validities  reported  In  Table  27  Illustrate  that  finger  dex¬ 
terity,  manual  dexterity,  and  wrist-finger  speed  are  valid  predictors  of  a 
number  of  criteria.  For  example,  all  three  abilities  have  demonstrated 
validity  as  predictors  of  various  subjective  criteria  (e.g,  Instructor 
evaluations  of  educational  and  school  achievement,  subjective  measures  of 
training  performance,  and  subjective  measures  of  job  proficiency),  with 
most  median  validities  for  these  criteria  ranging  from  .20  to  .30.  Median 
correlations  between  these  three  abilities  and  job-related  and  archival 
measures  of  job  proficiency  are  also  quite  high,  averaging  near  .20.  Me¬ 
dian  validities  for  school  grades  are  somewhat  lower,  ranging  from  .07 
to  .13. 

With  the  exception  of  multilimb  coordination,  very  little  Information 
Is  available  regarding  the  criterion-related  validity  of  any  other  psycho¬ 
motor  abilities  In  nonmilitary  research  settings.  Moreover,  the  validity 
summary  for  multilimb  coordination  Is  based  on  results  from  just  two 
studies,  one  Involving  cab  drivers  and  the  other  Involving  civilian  pilots. 
Two  different  criterion  measures  were  used  In  each  study.  Validities  were 
quite  low  across  all  four  criteria,  ranging  from  -.16  to  .10. 

Comparison  of  Tables  26  and  27  Is  Interesting  primarily  because  it  il¬ 
lustrates  the  differences  in  focus  between  military  and  nonmilitary  psy¬ 
chomotor  validity  research.  As  noted  previously,  the  military  has  Invested 
most  of  Its  research  effort  In  coordination  and  tracking  abilities,  while 
most  nonmilitary  psychomotor  research  has  been  concentrated  on  dexterity 
and  manipulation  abilities. 

Since  planes  and  pilots  probably  represent  the  military’s  greatest 
single  Investments  In  hardware  and  human  resources,  respectively,  it  Is 
easy  to  understand  why  the  military  has  directed  most  of  its  psychomotor 
validity  research  towards  improving  pilot  and  aircrew  selection.  Small 
Improvements  In  these  selection  procedures  can  result  In  great  savings  in 
equipment  and  training  costs.  The  job  of  pilot,  however,  Is  probably  one 
of  the  few  occupations  requiring  high  levels  of  coordination  and  precise 
tracking  and  control  adjustment  skills. 

In  contrast,  most  nonmilitary  research  has  been  directed  towards 
assessing  the  criterion-related  validity  gf  dexterity  abilities,  such  as 
finger  dexterity,  manual  dexterity,  and  wrist-finger  speed.  These  are  the 
psychomotor  abilities  which  seem  to  be  most  relevant  to  clerical,  Indus- 
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trial,  and  maintenance  and  repair  jobs.  Typically,  these  jobs  do  not 
require  the  precise  coordination  of  limb  movements  which  the  job  of  pilot 
requires.  Nor  do  they  require  continuous  adjustment  to  changing  stimulus 
conditions.  Instead,  most  of  these  jobs  seem  to  demand  moderately  precise, 
moderately  rapid,  repetitive  object  manipulations.  The  specific  manipula¬ 
tions  required  are  often  affected  little  by  events  in  the  environment,  so 
there  Is  no  need  to  monitor  and  adjust  motor  responses  In  accordance  with 
these  environmental  changes.  Therefore,  It  is  not  surprising  that  nonmili¬ 
tary  research  has  focused  on  validating  tests  of  these  dexterity  abilities. 

The  validity  results  presented  In  Tables  26  and  27  lend  support  to 
these  observations.  For  example,  median  validities  for  multilimb  coordina¬ 
tion  for  military  studies  average  .20,  while  those  for  nonmllitary  studies 
average  .00.  Validity  results  are  exactly  opposite  for  finger  dexterity. 
Median  validities  for  nonmilitary  studies  average  .20,  while  those  for 
military  studies  average  less  than  .10.  These  median  validities  point  to 
differences  In  the  psychomotor  demands  of  the  military  (i.e.,  pilots)  and 
nonmilitary  (I.e.,  clerical,  Industrial,  and  maintenance  and  repair)  jobs 
studied. 

Validity  Summary  for  All  Jobs.  Table  28  provides  a  summary  of  all  of 
the  validity  coefficients  Identified  during  the  literature  review.  The 
table  contains  very  little  new  Information,  but  It  does  provide  a  concise 
overview  of  the  criterion-related  validity  results  reported  In  the  litera¬ 
ture.  Generally,  these  results  Indicate  that  psychomotor  abilities  are 
valid  predictors  of  educational  and  school  achievement,  training  perfor¬ 
mance,  and  job  proficiency.  Indeed,  of  the  10  abilities  summarized  In 
Table  28,  only  arm-hand  steadiness  fails  to  attain  moderate  validity  (I.e., 
median  r->.10)  for  at  least  one  of  the  12  criterion  constructs.  Additional 
analysis  of  this  table  reveals  that,  of  the  33  predictor-criterion  cells 
containing  at  least  five  validity  observations  (I.e.,  L->5),  median  va¬ 
lidities  range  from  .20  to  .29  In  nine  cells  (27.3%),  and  from  .10  to  .19 
In  18  cells  (54.5%).  Median  validities  are  less  than  .10  In  only  six  cells 
(18.2%). 

Comarison  of  Criterion-Related  Validities  by  Job  Type  and  Criterion 
Category.  Table  29  provides  a  summary  of  most  of  the  validity  evidence 
presented  previously  in  Tables  8-28.  The  table  was  designed  to  aid  in  the 
comparison  of  validities  across  job  types  and  criterion  categories  for  each 
of  the  ten  psychomotor  ability  constructs. 

The  validity  data  suggest  that  different  psychomotor  abilities  may  be 
useful  for  prediction  for  different  job  types.  For  example,  tests  of 
multilimb  coordination  and  control  precision  have  relatively  high  validity 
for  predicting  educational  and  training  criteria  for  professional,  tech¬ 
nical,  and  managerial  jobs  (recall  that  most  of  these  validities  are  based 
on  studies  of  pilots),  while  tests  of  finger  dexterity,  manual  dexterity, 
and  wrist-finger  speed  appear  to  be  most  useful  for  predicting  educational 
and  job  proficiency  criteria  for  Industrial  jobs. 

The  validity  data  also  indicate  very  little  difference  in  the  predict¬ 
ability  of  different  criterion  categories.  While  the  overall  median  valid¬ 
ity  for  job  proficiency  criteria  (median  r-.20,  Af»l,358  validity  coeffi¬ 
cients)  Is  somewhat  greater  than  the  median  validity  for  educational  (me- 
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Median  Validity 
Weighted  Mean  Validity 


Table  28  (continued) 

Validity  Summary  for  All  Jobs  (N  =  2373  Validity  Coefficients) 
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Table  29  (continued) 

Median  Validities  bv  Job  Type  and  Criterion  Category 
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Table  29  (continued) 
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each  encry  la  thla  cabla  coaalsts  of  two  auabata.  Tha  daclaal  nuabar  la  tha  aadlan  validity  coalflclant  (or  that  ItiUeulii  payeboaotot 
ability  -  crltarlaa  catajory  -  Jab  typa  caablnatloa.  Iha  nuaber  In  yaraatheaaa  vhlch  follow*  la  tha  nuabar  a<  validity  coarflclaata  cab 
alatad  la  coap«U3|  tha  aadlaa. 


dl an  r». 13,  /V- 407  validity  coefficients)  and  training  (median  r-.15,  N- 603 
validity  coefficients)  criteria,  the  results  vary  considerably  from  job 
type  to  job  type  and  from  ability  to  ability. 

Table  29  shows  that  most  of  the  psychomotor  abilities  have  been  used 
successfully  as  predictors  (i.e.,  median  r«>.20)  In  at  least  one  job  type, 
for  example,  multilimb  coordination  has  been  an  effective  predictor  for 
professional,  technical,  and  managerial  jobs  and  for  protective  service 
jobs.  Manual  dexterity  has  demonstrated  validity  for  service  jobs  and 
Industrial  jobs.  For  those  abilities  which  have  not  evidenced  their  value 
as  predictors,  the  problem  may  be  a  lack  of  research.  For  example,  in 
total  only  28  validity  coefficients  are  available  for  rate  control,  only 
seven  are  available  for  aiming,  only  28  are  available  for  arm-hand  steadi¬ 
ness,  and  only  one  Is  available  for  speed  of  arm  movement.  Additional 
research  may  show  that  tests  of  these  abilities  are  also  effective  predic¬ 
tors  for  some  job  types. 

Finally,  Table  29  suggests  that  psychomotor  abilities  may  be  useful 
predictors  only  for  certain  job  types.  For  example,  across  all  abilities 
and  criteria,  the  median  validity  for  psychomotor  abilities  for  Industrial 
jobs  Is  .23  (/V-814  validity  coefficients).  Psychomotor  abilitias  have  also 
demonstrated  moderate  validity  for  professional,  technical,  and  managerial 
jobs  (median  r-.14,  N- 793  validity  coefficients),  clerical  jobs  (median 
r-.14,  /V-143  validity  coefficients),  protective  service  jobs  (median  r«.13, 
/M33  validity  coefficients),  service  jobs  (median  r-.K,  /V- 170  validity 
coefficients),  mechanical  and  structural  maintenance  jobs  (median  r».19, 
/M84  validity  coefficients),  electronics  jobs  (median  r-.13,  /V- 30  validity 
coefficients),  and  miscellaneous  jobs  (median  r-.18,  N- 91  validity  coeffi¬ 
cients).  These  job  types  would  Include  a  number  of  Army  MOS.  On  the  other 
hand,  the  median  validity  for  sales  jobs  was  only  .08  (/IM5  validity  coef¬ 
ficients),  Indicating  that  psychomotor  ability  tests  would  have  no  utility 
as  predictors  for  this  nonmilitary  job  type. 

Summary  of  the  Validity  Evidence 

Past  research  demonstrates  that  psychomotor  ability  tests  can  be  valid 
predictors  of  educational,  training,  and  job  proficiency  criteria  for  many 
job  types.  Indeed,  the  validity  data  presented  in  Tables  7-29  suggest  that 
sales  may  be  one  of  the  only  job  types  for  which  no  psychomotor  abilities 
have  any  predictive  validity. 

Test  development  efforts  should  be  guided  by  a  content-oriented  test 
development  strategy.  Military  psychologists  who  have  adopted  this  stra¬ 
tegy  In  the  past  (e.g.,  Eggenberger,  1976;  Guilford  &  Lacey,  1947;  Melton, 
1947)  have  met  with  good  success  In  predicting  job  and  training  perfor¬ 
mance.  Moreover,  content  valid  tests  typically  have  a  great  deal  of  face 
validity,  Increasing  the  likelihood  that  the  tests  will  be  accepted  by 
users  and  test  subjects. 

In  sum,  the  major  questions  regarding  the  usefulness  of  psychomotor 
tests  for  personnel  selection  and  classification  have  little  to  do  with  the 
tests’  validity  (cf.  North  &  Griffin,  1977;  Passey  &  McLaurln,  1966; 
Zeidner,  Martinek,  and  Anderson,  1961).  Rather,  the  questions  concern 
special  psychomotor  testing  Issues  (e.g.,  test  reliability,  cost 
effectiveness/utllity  of  .psychomotor  testing,  group  differences  in  psycho- 


119 


motor  test  scores,  the  fidelity  between  psychomotor  tests  and  the  criteria 
they  are  intended  to  predict).  These  questions  are  addressed  in  the  next 
section. 


120 


SECTION  V 


CURRENT  ISSUES  IN  RESEARCH  ON  PSYCHOMOTOR  ABILITIES 

Stability  and  Differential  Stability 

One  of  the  major  areas  of  current  psychomotor  ability  research  is  the 
stability  of  Individual  differences  In  psychomotor  test  scores  as  the  test 
task  Is  practiced. 

Some  of  the  Issues  involved  In  this  research  have  been  noted  pre¬ 
viously  In  the  discussion  of  Fleishman's  distinction  between  skills  and 
abilities.  Recall  that  Fleishman  (1960;  Fleishman  &  Hempel ,  1954b,  1955; 
Parker  &  Fleishman,  1962)  and  others  (e.g.,  Adams,  1953,  1957;  Hlnrlchs, 
1970)  have  found  that  the  abilities  accounting  for  variance  In  skilled 
motor  performance  change  with  practice.  In  general,  these  researchers  have 
found  that  correlations  between  cognitive  abilities  (e.g.,  verbal  ability, 
spatial  relations)  and  task  proficiency  decrease  as  the  task  Is  practiced, 
while  correlations  between  psychomotor  (e.g.,  speed  of  arm  movement)  and 
basic  perceptual  abilities  (e.g.,  reaction  time,  response  orientation)  and 
task  proficiency  Increase  slightly.  In  addition,  as  the  task  Is  practiced, 
there  Is  an  ever  Increasing  amount  of  variance  specific  to  the  task  Itself 
(Fleishman,  1975). 

This  skllls-abllltles  research  suggests  that  the  methods  typically 
used  to  assess  psychomotor  abilities  for  criterion-related  validity  re¬ 
search  may  be  deficient.  In  most  such  research,  subjects  are  given  at  most 
one  or  two  practice  trials  before  the  tests  are  administered  (e.g.,  Melton, 
1947;  U.S.  Department  of  Labor,  1970).  Typically,  tests  then  span  no  more 
than  four  to  six  trials  and  require  only  about  5  minutes  of  actual  testing 
time.  Fleishman’s  research  Indicates  that  the  resulting  test  scores  are 
likely  to  be  at  least  somewhat  contaminated  with  variance  attributable  to 
cognitive  abilities.  Most  of  this  variance  could  be  eliminated  If  subjects 
were  permitted  more  practice  time  prior  to  test  administration. 

Over  the  past  several  years,  the  Naval  Medical  Research  and  Develop¬ 
ment  Command  has  sponsored  a  number  of  studies  as  part  of  an  effort  to 
study  the  effects  of  adverse  environments  (e.g.,  vibration  and  ship  motion) 
on  task  performance  (Kennedy  &  Bittner,  1977;  Kennedy,  Carter,  &  Bittner, 
1980).  Research  on  the  impact  of  environmental  effects  almost  always 
requires  the  use  of  repeated-measures  designs  to  assess  pre-,  per-,  and 
post-exposure  effects  on  task  performance.  Results  from  repeated-measures 
designs  can  be  misleading,  however,  If  performance  Is  measured  on  a  task 
that  shows  practice  effects.  Therefore,  an  Initial  goal  of  this  research 
has  been  to  Identify  a  battery  of  Performance  Evaluation  Tests  for  Environ¬ 
mental  Research  (PETER)  which  can  be  used  In  repeated-measures  studies  of 
different  environments. 

According  to  Jones,  Kennedy,  and  Bittner  (1981),  tests  which  are 
Included  in  the  PETER  battery  must  stabilize  over  time.  That  is,  the  test 
task  may  show  practice  effects  initially,  but  "there  (must)  come  a 
point... after  which  practice  effects  no  longer  appear"  (p.  143).  Bittner 
(1979)  has  called  this  criterion  differential  stability.  "Differential 
stability  refers  to  the  point  In  practice  after  which  Individual  learning 
curves  are  parallel  and  at  most  slowly  increasing,  except  for  random  error; 
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after  this  point  all  intertrial  (interday)  correlations  are  the  same  except 
for  random  sampling  variations"  (Kennedy,  Bittner,  &  Jones,  1981,  p.  310). 
Differential  stability  also  implies  that  the  variance  among  subjects’  test 
scores  remains  constant  across  trials  and  that  the  correlations  between  any 
given  early,  "unstabilized"  trial  and  all  stabilized  trials  are  approxi¬ 
mately  equal  (Jones,  Kennedy,  &  Bittner,  1981). 

Since  not  all  tasks  stabilize  (l.e.,  show  differential  stability),  the 
PETER  researchers  were  first  Interested  in  identifying  a  stable  set  of 
tasks  for  the  PETER  battery.  In  addition,  for  tasks  that  were  stable, 

PETER  researchers  sought  to  determine  how  much  practice  was  required  before 
differential  stability  was  attained. 

To  date,  PETER  differential  stability  research  has  focused  on  broad 
cognitive  abilities,  such  as  grammatical  reasoning  (Carter,  Kennedy,  & 
Bittner,  1981)  and  arithmetic  computation  (Seales,  Kennedy,  &  Bittner, 
1980),  and  complex  perceptual -psychomotor  tasks  modeled  In  many  Instances 
after  actual  Navy  job  duties.  These  Include  a  navigation  plotting  task 
(Wlker,  Kennedy,  &  Pepper,  1983),  a  number  of  Atari  video  games  (Jones, 
Kennedy,  &  Bittner,  1981;  Kennedy,  Bittner,  Harbeson,  &  Jones,  1982),  and  a 
compensatory  tracking  task  (Kennedy,  Bittner,  &  Jones,  1981). 

Almost  all  of  the  tasks  the  PETER  researchers  have  examined  have 
eventually  demonstrated  differential  stability.  In  most  cases,  however, 
considerable  practice  was  required  before  the  tasks  stabilized.  For  ex¬ 
ample,  the  Atari  Air  Combat  Maneuvering  (ACM)  game  did  not  attain  differen¬ 
tial  stability  until  after  subjects  had  practiced  30  minutes  per  day  for 
six  days--and  the  ACM  game  became  stable  more  quickly  than  any  of  the  other 
perceptual -psychomotor  tasks  examined. 

These  results  may  be  surprising  In  light  of  the  high  psychomotor  test 
reliabilities  reported  by  Melton  (1947),  Fleishman  (1958),  and  others. 
Reliabilities  for  some  of  the  more  commonly  used  psychomotor  tests  are 
reported  In  Table  30.  Most  test-retest  reliabilities  for  Intertest  dura¬ 
tions  of  one  month  or  less  are  between  .70  and  .90,  Indicating  fairly  good 
reliability  for  these  psychomotor  tests.  How  can  these  results  be  recon¬ 
ciled  with  the  differential  stability  results  obtained  by  the  PETER  re¬ 
searchers? 

First,  it  Is  Important  to  note  that  differential  stability  Is  not  the 
same  as  high  Intertrial  correlations  or  high  test-retest  reliability. 
Indeed,  the  average  correlation  among  stabilized  trials  could  be  .60,  .40, 
or  even  .20  when  differential  stability  Is  attained.  Differential  stabil¬ 
ity  does  not  imply  that  measures  are  relatively  free  of  random,  unreliable 
variance.  Rather,  It  means  that  the  percentage  of  task  performance  vari¬ 
ance  which  Is  reliable  (l.e.,  common  across  days  or  trials)  does  not  change 
from  day  to  day  or  trial  to  trial  once  the  task  has  stabilized. 

Second,  test-retest  reliabilities  reported  by  the  PETER  researchers 
are  not  appreciably  different  from  those  reported  by  Melton  (1947), 
Fleishman  (1958),  and  others.  Reliabilities  for  some  of  the  PETER  psycho¬ 
motor  tests  are  summarized  In  Table  31.  The  median  reliability  for  sta¬ 
bilized  trials  for  these  seven  tests  is  .80.  This  is  very  similar  to  the 
median  test-retest  reliability  of  .78  for  the  psychomotor  tests  summarized 
In  Table  30.  (This  median  is  based  on  only  those  reliabilities  reported 
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liabilities  for  Selected  Psvchomotor  Tests 
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3-year  test -retest  Government  am-  U.  S.  Department 

reliability  ployees  (M  un-  of  Labor,  1970 
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Seliabilities  for  Selected  Psychowotor  Tests 
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.69  Odd-even  split-half  reliabll-  M=204  basic  Fleishaan,  1958 

Ity,  corrected  for  the  full  trainee  airswn 

length  of  the  test  (4  trials) 
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Steadiness  Aiming  Test  .96  Average  inter-trial  corrals*  N=461  aviation  Nelton,  1947 


O 

« 

'O 

m 

o» 

O* 

K. 

N. 

K 

■5  * 

• 

o* 

c  c 

c 

— 

•  m 

• 

m 

» 

B 

• 

c  x 

C 

X 

c 

c 

O  M 

O 

W 

o 

o 

.tf  — 

44 

•a* 

44 

44 

C  o 

— 

O  . 

— 

m  — 

o 

o 

o 

a.  u. 

X 

IH 

X 

X 

( 

—  M  C 

•  •  c 

1  w  c 

c  o 

AO  — 

c 

—  o  — 

—  o  — 

•  Oh* 

•  HO 

o 

•  4*  m 

•  44  * 

SCO 

•  •  c 

■ 

woe 

•  •  c 

£  O  K 

—  TJ  H 

o  c 

•  TJ  44 

•  *0  44 

«  s 

o  — 

— 

—  im 

o  o  o 

C  TJ  h 

•  • 

O  TJ  44 

O  TJ  44 

M 

c  X  u 
<*•  Cl  u 

M 

3  C  O 

• 

c  c  o 

c  c  o 

V 

44 

•  — 

X  o 

3  •  — 

3  fl  — 

c 

o  o 

c 

<M  O  — 

o 

u  — 

u  — 

• 

K1  MU. 

o 

—  a 

c 

>o  a 

o  a 

TJ 

O 

TJ 

O*  TJ 

o  — 

4-  TJ 

rg  TJ 

3 

IM  TJ  C 

3 

—  o  c 

0) 

OJ  • 

«#  O  C  01 

KT  o  c 

— 

If.  C  — 

44 

N  —  O 

c 

N  C 

N  —  O  C 

M  —  O 

1  fl 

*1  ■  < 

• 

Blc  «H 

M\  H 

»«■  «H  — 

*I«H  «H 

«H 

•  4«s 

•  ^ 

•  O 

—  • 

—  A 

c 

O  O  X 

4  —  • 

X  —  • 

X  o  4* 

•  3  — 

0  3  — 

44  o» 

—  «H  C 

4*  L 

•  c 

— •  4# 

—  44 

C  W  « 

o  o 

o  o 

O  fl  — 

C  X  ® 

C  X 

44 

• 

4*  w 

H  — 

« 

fl  4-«  — 

«H 

«H 

o 

«  - 

—  C  4* 

—  1-4* 

44 

44 

c  O  3 

•  o  • 

•  O  A 

• 

fl 

X  O 

X  4-  o 

C 

o 

U  6* 

44 

4* 

i 

44 

fl  O 

H  TJ 

H  TJ 

44 

o 

X 

—  O  O 

—  o  o 

• 

c 

c  o  44 

—  ♦*  X 

—  4*  x 

o 

• 

o 

a.  u  h 

Q.  U  4* 

44  >. 

44  V 

IINL 

a  o 

•  O 

44 

fl  44 

3—0 

C  H 

L  «H 

o  — 

O  — 

—  fl  h- 

44 

C  c  o 

c  C  o 

44  w 

44  — 

>  5  *z  *o 

M 

c 

o  o 

o  o 

•  — 

•am 

o 

3 

>  O  X 

>  o  X 

—  X 

M  X 

**  « 

44 

O 

O  w 

O  44 

TJ  • 

O  fl 

c  4* 

C 

•  *  01 

•  *  a» 

O  — 

o  — 

OKU 

o 

Jd 

2  >•  c 

TJ  >»  C 

1  w 

3  — 

—  —OX 

C 

■Oho 

TJ  4*  o 

!  o 

•  o 

V  «  L 

44 

3 

—  c 

4*  L. 

* 

o 

o 

w 

to 

tO 

o* 

to 

to 

K 

e 

e 

4* 

M 

o 

H' 

c 

o 

44 

fl 

c 

Coord 

1947) 

?  c 

•  o 

Z  V 

o  o 

3  X 

*-  w 

129 


.87  28-day  test-retest  M.=700  unclassl*  Nelton,  1947 

reliability  fied  candidates 

for  pi  lot  train- 


Table  30  (Continued) 


■ 

4.  £. 

4>  M 
M  —  O 
C-  «J  O 
«  —  ©» 


m  **  • 
m  m  u 

—  -p 
u  — 

C  P  W 

3  C  O 

•  — 

IM  U  — 

•-  a 

*-  TJ 

rsj  4>  u  a» 
m ~  O  C 
s|*»  — 


•  «  c 

—  41  — 

«  • 

uni. 

•  73  ** 

O  T)  w 

c  c  o 
3  a  — 
o  — 

o  a 

O  T3 

in  4i  l  ai 
H  •-  O  C 
M\^  «*■  — 


c  u 

I 

■  4.  o 

X  o  « 

«  c 

41  O  4) 
b  £  U  (A 
*-  Q.  t_  *-• 
O  O  C 
n  MIL  II 

O  T> 

N  T3  b  3 

M ,C  •-  *- 

xl  «  <  M 


41  T)  £ 

C  *4-  4>  41  ♦- 

•  ♦-  «C 

•  41  O 

41  b  L  V 
.C  3  U  C  f 

*-  C  O  4»  3 

u  «  « 

Cl  • 

41  *  i.  41 

•  oi  -M  a.c 

S  C  «  41  V 

«  -  41  U 

f  L  ««  *- 

i  3  o  o 

T>  •  * 

C  J1  X 

—  4-  —  4> 

v  o  **>  a-<  l 

■  u  o  c  —  o 

—  VI  4>  X)  U 

•  |A  —  •  M 

4-  TJ  — 

u  v  -n  c  —  o 

o  *-  c  o  v  s 


-  b  V 

b.  O 

*d  *.  ** 

b.  M 

•  *0  4) 

V  II  V 

C  b- 

—  O  V 
«  £ 

c  u 

•  b 

V  O  b- 

X  u  o 


•  0  41 
X5  *■  4l 

♦d 

w  -o 

—  4»  41 

—  *d  .C 

a  u  v 

«  41 

b  **- 

C  b  o 
4»  O 
>  b>  jc 
41  *» 

•  •  01 

2  >*  c 

ir  • 


c  «*»  «  — 

4>  4»  ** 

41  41  bd  —  M 

3  X  —  41 

v  v  •  3  v 

i 

O  4»  — 

e  h-  .c  ■ 

O  4T  O  *d 

—  O  40 

*d  «  «|  i.  w 

•  4>  4>  O 

—  «d  ^  «d 

V  4.  3  M 

b,  O  C  73  41 

41  Id 

O  ■  ~ 

ox  o  4» 

41  41  41  X 

C  4-  V  4-  *d 

fl  O  b  b 

VU-CO*- 

X  M  bd  o  O 


130 


Table  31 


Rail  ability*  of  PETER  Psychowotor  Tests 


Test 

Reliability 

Samole  S1zeb 

Reference 

Atari  Air  Combat  Maneuvering 

.93 

N-22 

Jones,  Kennedy,  and 

Bittner,  1981 

Atari  Slalom 

.60 

M-15 

Kennedy  and  Bittner,  1980 

Compensatory  Tracking 

.78 

N-18 

Kennedy,  Bittner,  and 

Jones,  1981 

Critical  Tracking 

.85 

N-18 

Damos,  Kennedy,  and 

Bittner,  1980 

Spoke  Task 

.80 

N-18 

Bittner,  Lundy,  Kennedy, 
and  Harbeson,  1982 

Tapping 

•  85 

N-18 

Kennedy,  Bittner,  and 
Elnbender,  1980 

Trail  Making 

.40 

N-18 

Kennedy,  Bittner,  and 
Elnbender,  1980 

a.  The  reliability  represents  the  average  correlation  between  stabllzed  dally  test  scores. 
This  reliability  Is  actually  a  form  of  test-retest  reliability,  with  the  Interval 
between  testing  sessions  ranging  from  1  to  14  days,  depending  on  the  particular  test 
and  the  number  of  days  required  for  the  test  to  stabilize. 

b.  All  subjects  were  Navy  enlisted  men. 
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for  intertest  durations  of  one  month  or  less.) 

Nevertheless,  the  PETER  researchers  have  suggested  that  task  perfor¬ 
mance  be  assessed  only  after  the  task  has  stablized.  In  part,  this  is 
because  data  gathered  via  repeated-measures  designs  are  much  more  difficult 
to  analyze  if  differential  stability  has  not  been  attained  (Jones,  Kennedy, 
&  Bittner,  1981).  In  part,  however,  it  also  reflects  a  concern  that  the 
task  is  changing  and/or  the  abilities  used  by  the  subject  to  perform  the 
task  are  changing  prior  to  the  time  at  which  the  task  stabilizes  (cf.  Hulin 
&  Alvares,  1971).  If  either  or  both  of  these  is  true,  then  the  task  may 
not  be  measuring  the  same  ability  or  abilities  across  assessment  trials. 

While  this  may  at  first  glance  appear  to  be  a  serious  problem  which 
jeopardizes  the  criterion-related  validity  of  many  psychomotor  tests,  evi¬ 
dence  from  a  number  of  sources  suggests  that  this  really  Is  not  a  threat  to 
the  use  of  psychomotor  tests  as  predictors  of  job  performance. 

First,  even  without  differential  stability,  test-retest  reliability 
for  the  PETER  tests  remains  quite  high  across  15  days  (e.g.,  .78  for  the 
Navigation  Plotting  task  and  .69  for  the  Atari  Air  Combat  Maneuvering 
task).  These  correlations  suggest  that  the  abilities  required  to  perform 
an  unstabilized  task  are  by  and  large  quite  similar  to  the  abilities  re¬ 
quired  to  perform  the  task  after  It  has  stabilized. 

Second,  Fleishman’s  research  cited  above  indicates  that  the  variance 
not  shared  by  the  unstabilized  task  and  the  stabilized  task  is  primarily 
attributable  to  broad  cognitive  abilities  such  as  verbal  ability 
(Fleishman,  1975).  To  the  extent  that  the  criterion  job  performance  mea¬ 
sure  Is  uncorrelated  with  cognitive  abilities,  traditional  psychomotor 
ability  assessment  procedures  may  yield  an  attenuated  estimate  of  the  true 
correlation  between  psychomotor  abilities  and  job  performance  criteria. 
Nevertheless,  the  test-retest  reliabilities  reported  in  Tables  30  and  31 
and  the  research  on  ability  correlates  of  psychomotor  task  performance 
cited  previously  Indicate  that  performance  on  even  the  first  few  trials  of 
a  psychomotor  task  Is  heavily  Influenced  by  psychomotor  abilities.  For 
this  reason,  AAF  researchers  found  that  their  psychomotor  tests  contributed 
unique  variance  to  the  prediction  of  pilot  training  attrition  (Melton, 
1947). 

Thus,  while  stabilized  tasks  might  be  more  valid  as  predictors  of  job 
performance  for  jobs  which  are  highly  dependent  upon  psychomotor  abilities 
(l.e.,  because  there  will  be  relatively  little  variance  In  the  predictor 
which  Is  attributable  to  non-psychomotor  abilities),  unstabilized  tasks 
will  generally  be  almost  as  valid.  In  addition,  assessment  of  unstabilized 
tasks  Is  much  more  economical  than  assessment  of  stabilized  tasks.  Assess¬ 
ment  of  unstabilized  tasks  precludes  the  need  to  expend  valuable  subject 
time  on  task  practice.  If  subject  time  is  11mited--as  it  almost  always 
1s--use  of  unstabilized  tasks  allows  assessment  of  a  much  broader  range  of 
tasks  and  abilities. 


Utility  of  Psychomotor  Predictors 

Lack  of  validity  data  for  psychomotor  tests  and  lack  of  information 
about  the  standard  deviation  of  job  performance  in  dollars  for  various  Army 
job  groups  preclude  a  formal  analysis  of  the  utility  of  psychomotor  tests 
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as  guides  for  job  classification  of  Army  enlisted  personnel  (Brogden,  1946, 
1949;  Cronbach  &  Gleser,  1965).  The  validity  information  summarized  pre¬ 
viously,  however,  suggests  that  psychomotor  tests  will  indeed  be  effective 
as  predictors  of  job  performance  for  many  Army  jobs.  In  addition,  critical 
incident  data  collected  for  first-tour  soldiers  Illustrates  that  ineffec¬ 
tive  job  performance  can  be  quite  costly  to  the  Army  in  terms  of  deadlined 
or  Irreparably  damaged  equipment  and  vehicles  and  personnel  time  lost  to 
maintenance  and  repairs. 

Nevertheless,  there  are  at  least  two  reasons  that  psychomotor  testing 
might  lack  utility  for  the  Army.  First,  it  may  be  that  psychomotor  tests 
contribute  little  or  no  unique  variance  to  the  prediction  of  job  perfor¬ 
mance.  And,  second,  it  may  be  that  psychomotor  testing--or,  at  the  very 
least,  psychomotor  apparatus  test1ng--1s  so  expensive  that  any  Increments 
in  the  prediction  of  job  performance  are  more  than  offset  by  increases  in 
costs  associated  with  testing.  Each  of  these  possibilities  will  be  ad¬ 
dressed  below. 

Correlations  between  Psvchomotor  and  Cognitive-Perceptual  Abilities. 

To  the  extent  that  psychomotor  abilities  are  only  moderately  or  weakly  cor¬ 
related  with  cognitive-perceptual  abilities,  the  utility  of  psychomotor 
testing  will  be  enhanced.  This  is  because  low  correlations  between  psycho¬ 
motor  and  cognitive-perceptual  abilities  Indicate  that  psychomotor  abili¬ 
ties  can  contribute  unique  variance  to  the  prediction  of  job  and  training 
performance.  If,  however,  the  correlations  between  psychomotor  abilities 
and  cognitive-perceptual  abilities  are  high,  It  will  mean  that  psychomotor 
ability  tests  would  contribute  very  little  above  and  beyond  cognitive 
ability  tests  to  the  prediction  of  job  and  training  performance.  Since 
psychomotor  testing  tends  to  be  more  expensive  than  cognitive-perceptual 
testing  (i.e.,  because  of  the  need  for  special  test  apparatus  and/or  indi¬ 
vidual  test  administration),  high  correlations  between  psychomotor  and 
cognitive-perceptual  abilities  would  suggest  that  psychomotor  testing  would 
not  be  worthwhile  to  the  Army.  Under  those  conditions,  the  payoff  from 
psychomotor  testing  In  terms  of  Improved  prediction  and  classification 
would  be  negligible  at  best. 

To  determine  typical  correlations  between  various  psychomotor  and  cog¬ 
nitive-perceptual  abilities,  the  articles,  reports,  and  manuals  identified 
during  the  literature  search  were  examined  In  an  effort  to  locate  intercor¬ 
relation  matrixes  that  Included  both  psychomotor  and  cognitive-perceptual 
tests.  Five  such  sources  were  Identified  (Fleishman,  1954b;  Melton,  1947; 
Michael,  1949;  Parker  &  Fleishman,  1960;  U.S.  Department  of  Labor,  1970). 

In  total,  these  five  sources  contained  17  Intercorrelation  matrixes. 

Each  psychomotor  test  used  In  these  studies  was  classified  into  one  of 
the  nine  psychomotor  ability  constructs  (complex  psychomotor  predictors 
were  not  included  In  this  classification  scheme)  and  each  cognitive- 
perceptual  ability  test  was  classified  into  one  of  the  12  cognitive- 
perceptual  ability  constructs  used  in  the  validity  summary  tables  in  the 
literature  review  for  cognitive  abilities.  In  some  cases,  tests  did  not 
fit  neatly  into  any  of  the  21  ability  constructs  and  therefore  were  ex¬ 
cluded  from  further  analysis. 

Correlations  were  then  tabulated  between  each  psychomotor  ability  and 
each  cognitive-perceptual  ability.  In  total,  1,066  correlations  were  tabu- 
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lated.  Among  the  psychomotor  abilities,  the  number  of  correlations  ranged 
from  337  for  multlllmb  coordination  to  zero  for  aiming.  Within  the 
cognitive-perceptual  ability  domain,  the  number  of  correlations  ranged  from 
324  for  spatial  ability  to  zero  for  fluency. 

Median  correlations  were  then  computed  between  each  psychomotor  abil¬ 
ity  and  each  cognitive-perceptual  ability.  Table  32  summarizes  these 
median  correlations.  In  general,  the  table  shows  that  the  two  ability 
domains  do  not  overlap  greatly.  Of  the  61  median  correlations  reported  in 
the  table,  only  10  are  greater  than  .20,  and  over  40%  (25)  are  .10  or  less. 
Only  four  psychomotor  abilities  appear  to  be  even  moderately  correlated 
with  two  or  more  cognitive-perceptual  abilities. 

Multlllmb  coordination  Is  one  of  these  four  psychomotor  abilities. 
Median  correlations  for  multlllmb  coordination  Include  .29  with  mechanical 
aptitude  (/IM5  correlations)  and  .23  with  perceptual  speed  and  accuracy 
(/V-64  correlations). 

The  moderately  high  correlation  between  multilimb  coordination  and 
mechanical  aptitude  may  reflect  a  co-occurrence  of  learning  experiences. 
That  Is,  people  who  spend  a  lot  of  time  building  and/or  repairing  mechan¬ 
ical  equipment  are  likely  to  learn  mechanical  principles.  In  addition, 
they  are  likely  to  spend  a  lot  of  time  working  in  situations  which  require 
the  careful  coordination  of  two  hands  or  hands  and  feet.  Thus,  the  corre¬ 
lation  between  these  two  abilities  Is  not  surprising.  Nor  Is  the  moderate 
correlation  between  multlllmb  coordination  and  spatial  ability  (median 
r-.18,  AM07  correlations).  Most  tests  of  multlllmb  coordination  require 
the  subject  to  visualize  and/or  monitor  a  stimulus  pattern.  The  moderate 
correlations  of  multlllmb  coordination  with  perceptual  speed  and  accuracy 
and  perception  (median  r-.19,  N- 5  correlations)  may  also  be  due  to  this 
perceptual  requirement  of  multlllmb  coordination  tests. 

None  of  these  median  correlations  Is  very  high,  however;  all  are  less 
than  .30.  This  suggests  that  there  Is  still  a  great  deal  of  variance  In 
measures  of  multilimb  coordination  which  Is  not  tapped  by  cognitive-percep¬ 
tual  tests.  Moreover,  the  validity  summary  tables  described  previously 
(e.g.,  Table  29)  show  that  multlllmb  coordination  has  been  a  valid  pre¬ 
dictor  of  training  performance  for  at  least  two  job  types.  Therefore,  the 
correlations  between  multlllmb  coordination  and  these  three  cognitive- 
perceptual  ability  do  not  pose  a  serious  threat  to  the  utility  of  testing 
for  multlllmb  coordination. 

Finger  dexterity  Is  the  second  psychomotor  ability  which  Is  moderately 
correlated  with  at  least  two  cognitive-perceptual  abilities  (median  r-. 26 
with  perception,  Af-8  correlations;  median  r-.20  with  perceptual  speed  and 
accuracy,  W-29  correlations).  The  moderate  correlations  between  finger 
dexterity  and  these  two  cognitive-perceptual  abilities  are  probably  due  to 
the  perceptual  and  visualization  demands  of  finger  dexterity  tasks.  Fur¬ 
ther  evidence  for  this  is  supplied  by  the  median  correlation  of  .16  between 
finger  dexterity  and  spatial  ability  measures  [N* 52  correlations).  Never¬ 
theless,  these  median  correlations  are  only  moderate  at  best,  and  the 
median  validity  coefficients  reported  in  Table  29  suggest  that  finger 
dexterity  would  be  a  valid  predictor  of  educational  and  school  achievement, 
training  performance,  and  job  proficiency  in  several  job  types.  Thus,  it 
would  seem  prudent  to  continue  considering  finger  dexterity  for  inclusion 
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In  any  new  selection  and  classification  test  battery. 

Manual  dexterity  is  the  third  psychomotor  ability  which  appears  to  be 
moderately  correlated  with  more  than  one  cognitive-perceptual  ability. 
Median  correlations  involving  manual  dexterity  include  .32  with  perception 
(/V-8  correlations),  .24  with  perceptual  speed  and  accuracy  (AM2  correla¬ 
tions),  and  .21  with  memory  (AM  correlation).  These  median  correlations 
are  very  similar  to  those  between  finger  dexterity  and  these  three  cog¬ 
nitive-perceptual  abilities.  The  correlations  indicate  that  perceptual 
abilities  are  also  relevant  to  manual  dexterity  tasks.  The  correlations 
are  not  so  high,  however,  to  eliminate  manual  dexterity  from  consideration 
as  a  predictor  in  the  current  research,  especially  given  the  evidence 
presented  in  Table  29  indicating  the  potential  validity  of  manual  dexterity 
measures  for  predicting  training  performance  and  job  proficiency  criteria 
in  several  job  types. 

The  final  psychomotor  ability  with  moderate  or  high  correlations  with 
two  or  more  cognitive-perceptual  abilities  is  wrist-finger  speed.  Almost 
all  of  the  correlations  between  wrist-finger  speed  and  cognitive-perceptual 
abilities  presented  in  Table  32  are  based  on  studies  with  the  GATB.  Most 
of  these  correlations  are  quite  high  (median  r-.38  with  perceptual  speed 
and  accuracy,  AM7  correlations;  median  r«.26  with  perception,  AM 4  cor¬ 
relations;  median  r«.25  with  memory,  AM  correlation;  median  r-.24  with 
number  facility,  AM8  correlations;  and  median  r-.29  with  reasoning,  AM4 
correlations). 

One  possible  explanation  for  these  high  correlations  is  that  wrist- 
finger  speed  is  a  measure  of  marking  speed.  In  the  GATB,  one  of  the  wrist- 
finger  speed  tests  is  simply  a  measure  of  the  number  of  circles  a  subject 
can  mark  in  a  fixed  period  of  time.  To  the  extent  that  a  subject’s  percep¬ 
tual  speed  and  accuracy,  perception,  memory,  number  facility,  and  reasoning 
test  scores  are  affected  by  ability  to  mark  an  answer  sheet  quickly,  it 
would  be  expected  that  correlations  between  these  abilities  and  wrist- 
finger  speed  would  be  high. 

The  correlation  of  .38  with  perceptual  speed  and  accuracy  is  particu¬ 
larly  easy  to  explain  in  this  manner.  For  most  perceptual  speed  and  accu¬ 
racy  tests,  subjects  are  presented  with  two  character  strings  (e.g.,  names, 
addresses,  nonsense  syllables)  and  asked  to  determine  whether  the  strings 
are  exactly  the  same  or  different.  In  most  cases,  any  differences  between 
the  two  strings  are  only  slight.  Computer  versions  of  this  test  demon¬ 
strate  that  most  subjects  can  determine  the  correct  answer  to  each  item  in 
just  a  fraction  of  a  second.  Indeed,  it  probably  takes  most  subjects  much 
more  time  to  mark  their  answer  sheets  than  to  determine  the  correct  answer. 
Thus,  it  is  not  surprising  to  see  the  high  correlation  between  wrist-finger 
speed  and  perceptual  speed  and  accuracy.  The  correlation  almost  certainly 
indicates  that  much  of  the  variance  in  paper-and-pencil  perceptual  speed 
and  accuracy  tests  is  attributable  to  marking  speed  (i.e.,  wrist-finger 
speed)  rather  than  to  perceptual  processes.  The  same  could  probably  be 
said  of  most  other  highly  speeded  paper-and-pencil  tests. 

Therefore,  the  moderately  high  correlations  between  wrist-finger  speed 
and  cognitive-perceptual  abilities  can  probably  be  attributed  in  large 
measure  to  test  marking  variance  embedded  in  the  cognitive-perceptual 
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ability  tests.  Since  the  Army  is  seriously  considering  changing  to  comput¬ 
erized  administration  of  most  of  its  selection  and  classification  tests,  it 
is  unlikely  that  future  tests  of  these  cognitive-perceptual  abilities  will 
be  so  highly  correlated  with  wrist-finger  speed.  Hence,  it  is  difficult  to 
justify  excluding  tests  of  wrist-finger  speed  from  future  selection  and 
classification  batteries  simply  on  the  basis  of  these  correlations.  More¬ 
over,  Table  29  shows  that  wrist-finger  speed  is  a  valid  predictor  of 
training  performance  and  job  proficiency  in  several  job  types,  indicating 
that  wrist-finger  speed  tests  may  have  high  utility  for  the  Army. 

One  particularly  interesting  fact  emerging  from  Table  32  is  that 
correlations  between  psychomotor  abilities  and  abilities  which  are  highly 
related  to  general  Intelligence  (e.g.,  verbal  comprehension  and  reasoning) 
are  generally  quite  low.  For  example,  excluding  wrist-finger  speed,  median 
psychomotor  ability  correlations  with  verbal  comprehension  range  from  -.07 
for  speed  of  arm  movement  (/V-l  correlation)  to  .12  for  rate  control  (W-3 
correlations).  These  median  correlations  suggest  that  there  is  virtually 
no  relationship  between  psychomotor  abilities  and  g. 

In  sum,  Table  32  provides  strong  evidence  that  the  ability  variance' 
tapped  by  psychomotor  ability  tests  Is  different  from  that  tapped  by 
cognitive-perceptual  ability  tests.  Taken  together,  Tables  29  and  32 
demonstrate  that  psychomotor  ability  tests  can  contribute  unique  variance 
to  the  prediction  of  job  and  training  performance.  This  suggests  that 
psychomotor  ability  testing  may  Indeed  have  high  utility  for  the  Army. 

Psvchomotor  Test  Administration.  Throughout  most  of  the  history  of 
psychomotor  testing,  there  have  been  two  options  for  test  administration 
format:  paper-and-pencll  measures  and  mechanical  apparatus  measures. 

Psychologists,  of  course,  have  long  been  enamored  with  paper-and- 
pencll  tests.  Paper-and-pencll  tests  provide  psychologists  with  the  oppor¬ 
tunity  to  control  almost  perfectly  the  presentation  of  test  stimuli  to 
subjects  and  to  ensure  that  all  subjects  are  provided  with  exactly  the  same 
test  instructions  and  test  conditions. 

Much  of  this  control  is  lost  with  apparatus  tests.  As  a  result, 
mechanical  apparatus  tests  have  several  disadvantages  In  comparison  with 
paper-and-pencll  tests. 

Perhaps  the  most  serious  disadvantage  of  mechanical  apparatus  tests  is 
the  lack  of  standardization.  Two  units  of  the  same  apparatus  may  be  quite 
different  with  respect  to  actual  operation.  Springs  may  be  new  and  re¬ 
silient  on  one  unit  and  old  and  unresponsive  on  the  second.  Nuts  and  bolts 
may  be  tight  on  one  unit  and  loose  on  the  second.  Contact  points  on 
switches  may  be  clean  and  new  on  one  unit  and  old  and  corroded  on  the 
second.  Such  differences  may  seem  trivial.  Yet,  when  one  considers  that 
some  of  the  more  complicated  mechanical  apparatus  consists  of  scores  of 
springs,  nuts,  bolts,  and  switches,  one  begins  to  appreciate  that  the 
effects  of  these  apparatus  unit  differences  might  be  quite  large  indeed. 
Confirmation  of  this  was  provided  by  research  conducted  by  the  AAF  during 
World  War  II  (Melton,  1947).  Researchers  found  numerous  correlations  be¬ 
tween  calibration  measurements  on  apparatus  and  mean  test  scores  on  the 
various  units  of  that  apparatus. 
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A  second  major  disadvantage  of  apparatus  tests  is  that  they  frequently 
break  down.  This  often  results  in  missing  or  invalid  data.  In  addition, 
since  broken  apparatus  units  are  not  as  easy  to  replace  as  damaged  or 
misprinted  paper-and-pencil  tests,  broken  apparatus  ui.its  may  play  havoc 
with  testing  schedules. 

Because  of  the  problems  associated  with  apparatus  testing,  psycholo¬ 
gists  have  generally  been  very  reluctant  to  use  apparatus  tests  unless  no 
alternative  measures  of  the  target  abilities  were  available.  As  Melton, 
(1947)  wrote  regarding  the  development  of  apparatus  tests  by  the  AFF  Avia¬ 
tion  Psychology  Program  during  World  War  II,  "At  no  time  was  there  the 
intention  of  measuring  with  apparatus  some  function  which  could  be  measured 
as  meaningfully  by  the  marking  of  an  IBM  answer  sheet"  (p.  54). 

Nevertheless,  as  was  noted  previously  In  the  taxonomy  section,  there 
appear  to  be  several  abilities  (e.g.,  multi limb  coordination,  control 
precision,  rate  control,  finger  and  manual  dexterity)  which  defy  measure¬ 
ment  via  paper-and-pencil  tests  (Cronbach,  1970).  To  assess  these  abili¬ 
ties,  there  appears  to  be  no  alternative  to  apparatus  testing. 

With  respect  to  utility,  the  major  Issue  is  whether  the  problems 
associated  with  apparatus  testing  can  be  compensated  by  the  gains  In 
criterion-related  validity  resulting  from  the  apparatus  test  scores.  While 
they  apparently  never  conducted  any  formal  analysis  of  apparatus  test 
utility,  the  military  decided  after  World  War  II  that  apparatus  testing  was 
not  worthwhile  (Passey  &  McLaurln,  1966).  This  occurred  in  spite  of  data 
collected  during  World  War  II  which  demonstrated  that  the  validity  of  many 
apparatus  tests  was  quite  high. 

Research  on  the  selection  of  helicopter  pilots  by  Zeidner,  Martlnek, 
arj  Anderson  (1958)  and  Fleishman’s  continuing  research  into  the  taxonomy 
of  psychomotor  abilities  sparked  new  interest  in  apparatus  testing  in  the 
late  1950s.  One  result  of  that  interest  was  renewed  research  Into  the 
reliability  of  apparatus  tests  by  Dobbins,  Martlnek,  Anderson,  and 
Rosenberg  (1961).  While  Dobbins  et  al.  were  encouraged  by  the  increments 
in  predictive  validity  produced  by  psychomotor  apparatus  tests  in  the 
selection  of  Army  helicopter  pilots,  they  were  generally  pessimistic  re¬ 
garding  the  utility  of  such  tests.  In  their  words: 

Results  of  the  study  reemphasized  the  need  for  caution  In  the  use 
of  apparatus  tests.  The  study  reaffirmed  the  fact  that  indivi¬ 
dual  differences  in  machines  do  occur  and  that  scores  tend  to 
fluctuate  significantly  with  continued  use  of  the  same  machine. 

A  statistical  solution  may  be  the  only  feasible  means  of  com¬ 
pletely  neutralizing  apparatus  differences.  Ideally,  however, 
score  fluctuations  through  continuing  machine  use  should  be 
handled  by  proper  maintenance  and  not  by  statistical  conversion. 
(Dobbins  et  al.,  1961,  p.  9) 

The  need  for  continuous  apparatus  maintenance,  as  well  as  the  need  to 
conduct  all  testing  through  centralized  facilities,  added  greatly  to  the 
expense  associated  with  apparatus  testing  and  largely  nullified  ary  gains 
in  utility  associated  with  these  tests. 

In  spite  of  these  consistent  negative  findings  with  respect  to  appar- 
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atus  test  utility,  psychologists  involved  in  pilot  selection  continued  to 
be  interested  in  including  apparatus  tests  In  their  selection  batteries. 

For  example,  reviews  of  the  psychomotor  ability  literature  by  Passey  ind 
McLaurln  (1966),  Hunter  (1975),  and  Imhoff  and  Levine  (1981)  all  resulted 
in  the  recommendation  that  apparatus  tests  be  included  in  future  Air  Force 
selection  and  classification  batteries. 

Events  wth  perhaps  the  greatest  Impact  on  apparatus  testing,  however, 
occurred  during  the  1970s.  Breakthroughs  in  microprocessor  technology 
resulted  In  a  now  generation  of  solid  state  apparatus  tests. 

Among  the  first  researchers  to  experiment  with  computerized  apparatus 
tests  were  Sanders,  Valentine,  and  McGrevey  (1971).  They  developed  new 
versions  of  the  Two-Hand  Coordination  Test  and  the  Complex  Coordination 
Test.  (See  Appendix  A  for  pictures  ,ind  descriptions  of  both  the  original 
mechanical  and  the  new  computerized  versions  of  these  two  tests.)  For  both 
tests,  Sanders  et  al.  used  computer  graphics  video  displays  to  present  the 
test  stimuli  to  subjects.  Subjects  were  then  required  to  perform  a 
tracking  task  using  either  a  pair  of  joy  sticks  (fur  the  Two-Hand  Coordina¬ 
tion  Test)  or  a  joy  stick  and  a  rudder  control  (for  the  Complex  Coordina¬ 
tion  Test).  Sanders  et  al .  estimated  the  parallel  forms  reliability  to 
be  .81  for  the  Two-Hand  Coordination  Test  and  .92  for  the  Complex  Coordina¬ 
tion  Test.  Sanders  et  al.  also  found  that  the  tests  did  not  correlate 
highly  with  any  of  the  cognitive-perceptual  tests  or  composites  In  the  Air 
Force  Officer  Qualifying  Test  (AFOQT).  In  a  follow-up  to  this  Initial 
study,  McGrevey  and  Valentine  (1974)  found  that  the  Complex  Coordination 
Test  was  extremely  effective  as  a  predictor  of  graduation  from  undergrad¬ 
uate  pilot  training  (r-.44,  /V- 121,  /x.05).  By  contrast,  a  regression- 
derived  prediction  equation  Involving  all  13  of  the  tests  comprising  the 
AFOQT  yielded  a  multiple  correlation  of  only  .37.  The  predictive  validity 
of  the  Two-Hand  Coordination  Test  was  not  quite  as  high  as  that  of  the 
Complex  Coordination  Test  (r-.18,  121,  ns). 

Since  the  pioneering  research  of  Sanders,  Valentine,  and  McGrevey,  a 
number  of  other  psychomotor  tests  have  been  adapted  for  the  computer.  In 
some  cases,  these  tests  are  rather  direct  measures  of  Fleishman’s  multilimb 
coordination,  rate  control,  ana  control  precision  abilities  (e.g.,  the 
Complex  Coordination  Test  developed  by  Myers  et  al.,  1982).  In  other 
Instances,  the  tests  are  rather  elaborate  job  simulations  designed  to 
predict  performance  In  specific  jobs  (e.g.,  the  tests  using  the  Willey 
"Burst-on-Target"  Simulator  developed  by  Biers  &  Sauer,  1982,  which  are 
used  to  predict  tank  crew  rformance).  Table  33  contains  a  list  of  some 
of  the  computerized  psychomotor  tests  currently  being  used  and  studied  by 
the  U.S.  military.  It  should  be  noted  that  none  of  the  tests  listed  In 
Table  33  are  measures  of  manual  or  finger  dexterity.  To  date,  no  one  has 
succeeded  in  producing  a  computerized  dexterity  test.  As  a  result,  there 
Is  currently  no  alternative  to  traditional  apparatus  measures  for  these  two 
abilities. 

Computerized  apparatus  tests  have  several  advantages  over  mechanical 
apparatus  tests. 

First,  the  computerized  tests  tend  to  be  much  more  reliable.  This  is 
because  the  solid  state  components  used  in  these  apparatus  are  much  less 
affected  by  wear  and  tear  than  the  mechanical  parts  used  in  the  older 
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Table  33 


Computerized  Psvchomotor  Tests  in  Use  in  the  Military 


Test 

Location 

Machine 

Complex  Coordination 
(stick  and  rudder) 

AFHRL 

Analog  apparatus 

Complex  Coordination 
(two  tracking  tasks) 

NAMRL 

Analog  apparatus3 

Computerized  Target 
Engagement 

Ft.  Knox 

Apple  II 

Critical  Tracking 

NAMRL 

PDP-11 

Gunner  Tracking  Task 
(using  the  Willey 
Burst-on-Target 
Simulator) 

Ft.  Knox 

Analog  apparatus 

Helicopter  Simulator 

Ft.  Rucker 

Analog  apparatus 

One-Dimensional  Compen¬ 
satory  Tracking 

NAMRL 

PDP-11 

Perceptronics  Simulator 

Ft.  Knox 

Analog  apparatus 

Psychomotor  Tracking  Task 

Ft.  Knox 

Apple  II 

Tank  Turret  Simulator 

Ft.  Knox 

Analog  apparatus 

Tank  Video  Game 

(b) 

Apple  II 

Target  Acquisition  Task 
(using  the  Willey 
Burst-on-Target 
Simulator) 

Ft.  Knox 

Analog  apparatus 

Two-Dimensional  Compen¬ 
satory  Tracking 

Ft.  Rucker 

Apple  II 

Two-Hand  Coordination 

AFHRL 

Analog  apparatus 

a.  NAMRL  is  currently  adapting  this  to  an  Apple  II  computer  with 
joy  stick  and  foot  pedals. 

b.  This  test  is  being  developed  under  a  contract  with  ARI.  The 
work  is  being  carried  out  at  NAMRL. 
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apparatus  tests.  Thus,  there  Is  much  less  Intra-unit  and  inter-unit  vari¬ 
ability  among  the  computerized  tests  than  there  was  among  the  mechanical 
tests.  This  Is  not  to  say  that  the  computerized  tests  are  perfectly  stan¬ 
dardized.  No  two  circuit  boards  will  ever  be  exactly  alike,  so  there  will 
always  be  some  test  score  variance  attributable  to  individual  differences 
in  apparatus  units.  In  addition,  even  the  most  state-of-the-art  com¬ 
puterized  apparatus  test  still  requires  many  mechanical  parts.  For  ex¬ 
ample,  almost  all  tracking  and  coordination  tests  require  the  subject  to 
use  mechanical  controls  such  as  joy  sticks  or  control  sticks  to  respond. 

To  date,  there  is  very  little  in  the  published  psychological  literature 
which  indicates  how  much  test  score  variance  is  attributable  to  differences 
in  these  mechanical  controls,  but  the  magnitude  of  this  variance  is  likely 
to  be  significant.  As  the  use  of  computerized  psychomotor  tests  increases 
over  the  next  several  years,  it  is  hoped  that  research  on  inter-unit  dif¬ 
ferences  among  computerized  apparatus  tests  will  be  conducted  and  pub¬ 
lished. 

A  second  advantage  of  computerized  apparatus  tests  over  mechanical  ap¬ 
paratus  tests  is  that  computerized  tests  are  much  less  likely  to  break 
down.  This  is  because  computers  have  fewer  moving  parts  than  the  old 
mechanical  apparatus.  As  a  result,  maintenance  costs  and  costs  associated 
with  "down  time"  have  been  reduced  considerably  with  computerized  testing. 

Third,  computerized  testing  results  in  even  greater  standardization  in 
testing  conditions  than  paper-and-pencll  testing.  With  computerized  test¬ 
ing,  instructions  to  subjects  can  be  presented  on  the  video  display  termi¬ 
nal,  a  standard  set  of  practice  trials  can  be  conducted,  and  the  test  can 
then  be  Initiated  by  the  subject.  No  administrator  intervention  is  neces¬ 
sary.  Indeed,  most  new  batteries  of  computerized  tests  automatically 
proceed  from  one  test  to  the  next  without  any  Intervention  by  the  test 
administrator.  Thus,  computerized  testing  helps  eliminate  almost  all  of 
the  variance  in  test  scores  attributable  to  nonstandardized  testing  condi¬ 
tions. 

Finally,  computerized  testing  greatly  reduces  data  capture  and  scoring 
costs.  Computers  can  be  programmed  to  automatically  record  each  response 
made  by  the  subject  and  to  score  these  responses  at  the  conclusion  of  the 
test.  This  procedure  effectively  eliminates  all  data  capture  errors  (e.g., 
data  which  are  keypunched  incorrectly). 

In  sum,  the  advent  of  computerized  psychomotor  testing  has  eliminated 
many  of  the  concerns  associated  with  the  unreliability  and  expense  of 
mechanical  apparatus  testing.  In  fact,  computerized  psychomotor  tests  may 
actually  be  cheaper  to  administer  and  score  than  traditional  paper-and- 
pencil  cognitive  ability  tests.  For  this  reason,  and  also  because  psycho¬ 
motor  apparatus  tests  often  contribute  unique  variance  to  the  prediction  of 
performance  In  many  jobs,  the  utility  of  psychomotor  testing  appears  to  be 
substantial. 


Psychomotor  Testing  and  Group  Differences 

Over  the  past  two  decades,  a  great  deal  of  research  has  been  directed 
toward  examining  group  differences  in  ability  test  scores.  Much  of  this 
research  has  been  motivated  by  concerns  over  test  fairness  and  the  legal 
requirements  of  selection  testing.  (A  summary  of  the  legal  and  related 


141 


psychometric  Issues  in  selection  testing  was  presented  In  the  literature 
review  for  cognitive  abilities.  Readers  Interested  In  differential  valid¬ 
ity,  models  of  test  fairness,  and  major  legal  decisions  related  to  selec¬ 
tion  testing  are  referred  to  this  summary.)  Unfortunately,  to  date,  almost 
all  of  this  research  has  been  conducted  In  the  cognitive-perceptual  and 
physical  ability  domains.  As  a  result,  few  data  are  available  on  group 
differences  In  psychomotor  abilities.  In  the  literature  reviewed  for  this 
report,  only  five  studies  discussed  group  differences  and/or  differential 
validity.  Results  from  these  studies  are  discussed  below. 

The  GATB  and  Group  Differences.  Data  on  group  differences  in  GATB 
psychomotor  aptitude  scores  were  reported  In  a  study  by  Fozard  and  Nuttall 
(1971)  and  In  the  GATB  test  manual  (U.S.  Department  of  Labor,  1970). 

Fozard  and  Nuttall  were  Interested  In  differences  In  GATB  aptitude 
scores  across  age  groups  and  socioeconomic  status  (SES)  groups.  Subjects 
In  their  study  were  1,146  men  participating  in  a  longitudinal  study  of 
aging.  At  the  time  of  testing,  the  men  ranged  from  28  to  83  years  old. 

SES  was  estimated  for  each  subject  using  the  Warner  system  (Warner,  Meeker, 
&  Eels,  1960).  With  the  Warner  system,  SES  estimates  are  based  primarily 
on  the  subject’s  occupation  (or  occupation  at  the  time  of  retirement). 

Even  though  the  subjects  were  participating  In  a  longitudinal  study,  for 
this  particular  study  each  subject  was  tested  only  once  (I.e.,  a  cross- 
sectional  design  was  used).  This  means  that  the  subjects  In  the  25-35  year 
old  age  group  were  not  the  same  subjects  as  those  In  the  46-50  year  old  age 
group.  Consequently,  some  of  the  age  differences  In  psychomotor  test 
scores  Identified  during  this  research  may  be  attributable  to  generational 
differences  rather  than  to  biological  changes  associated  with  aging. 

For  their  analyses,  Fozard  and  Nuttall  performed  two-way  (age  x  SES) 
analyses  of  variance  for  each  of  the  nine  GATB  aptitude  areas.  They  found 
statistically  significant  age  effects  for  all  nine  aptitudes  (p<.001  for 
all  three  psychomotor  aptitudes).  In  addition,  they  found  significant  SES 
effects  for  Finger  Dexterity  (p< . 01 ) ,  Motor  Coordination  (pc. 001),  and  all 
six  cognitive-perceptual  aptitudes  (pc. 001  for  all  six).  Manual  Dexterity 
was  the  only  aptitude  which  failed  to  show  a  significant  effect  for  SES. 
Finally,  Fozard  and  Nuttall  tested  for  age  by  SES  Interactions.  No  sta¬ 
tistically  significant  interactions  were  found  for  any  of  the  nine  apti¬ 
tudes.  Table  34  provides  a  summary  of  the  percentage  of  variance  at¬ 
tributable  to  age  and  SES  for  each  of  the  three  psychomotor  aptitudes. 

Fozard  and  Nuttall  also  examined  the  probable  Impact  of  age  and  SES  on 
selection  practices.  Typically,  GATB  selection  decisions  are  based  on 
Occupational  Aptitude  Patterns  (OAPs).  The  OAP  for  an  occupational  group 
is  based  on  the  two,  three,  or  four  aptitudes  correlated  most  highly  with 
job  or  training  performance  criteria.  Cutoff  scores  are  established  for 
each  of  these  aptitudes.  Applicants  are  deemed  qualified  for  an  occupa¬ 
tional  group  If  their  aptitude  scores  equal  or  exceed  the  cutoffs  for  a/7 
of  the  aptitudes  comprising  the  OAP.  Because  of  the  negative  correlation 
between  age  and  GATB  aptitude  scores,  Fozard  and  Nuttall  found  that  age  was 
strongly  related  to  the  number  of  occupational  groups  for  which  applicants 
qualified.  For  example,  Fozard  and  Nuttall  compared  the  average  aptitude 
profiles  of  the  oldest  and  youngest  groups  of  subjects  In  the  highest  SES 
group.  The  average  profile  of  the  youngest  subjects  qualified  them  on  all 
36  OAPs,  but  the  oldest  subjects  qualified  on  only  J7  OAPs.  Within  the 
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Table  34 


Percentage  of  Variance  Attributable  to  Age 
and  Socioeconomic  Status  for  the 
Three  GATB  Psvchomotor  Scales 


Percentage  of  Variance  Attributable  to: 

Aptitude 

Area 

Aoe 

Socioeconomic 

Status 

Motor 

Coordination 

3% 

5% 

Finger 

Dexterity 

14% 

1% 

Manual 

Dexterity 

11% 

0% 

Note.  The  data  are  from  "General  Aptitude  Test 
Battery  scores  for  men  differing  in  age  and 
socieconomic  status"  by  J.  L.  Fozard  and  R.  L. 
Nuttall,  1971,  Journal  of  Applied  Psychology ,  55, 
p.  375. 
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lowest  SES  group,  the  youngest  group  of  subjects  qualified  on  28  OAPs.  In 
contrast,  the  oldest  subjects  In  this  SES  group  qualified  on  only  nine 
OAPs.  Fozard  and  Nuttall  concluded,  "Clearly,  the  older  worker  In  all  SES 
groups  is  severely  handicapped  when  compared  to  yornger  men  on  the  basis  of 
OAP  cutoff  scores"  (Fozard  &  Nuttall,  1971,  pp.  376-377). 

Sex  differences  on  the  GATB  psychomotor  scales  were  studied  in  a 
sample  of  over  7,000  high  school  students  (U.S.  Department  of  Labor,  1970). 
Significant  differences  were  found  for  two  of  the  psychomotor  scales.  Boys 
scored  .4  standard  deviation  higher  than  girls  on  the  Motor  Coordination 
scale  and  .35  standard  deviation  higher  than  girls  on  the  Finger  Dexterity 
scale.  No  significant  sex  differences  were  found  for  the  Manual  Dexterity 
scale. 

Since  selection  decisions  with  the  GATB  are  typically  based  on  OAPs, 
no  data  are  available  to  compare  the  criterion-related  validity  of  the  GATB 
psychomotor  scales  for  males  and  females.  The  GATB  manual  does,  however, 
report  separate  criterion-related  validities  for  males  and  females  for  11 
different  OAPs,  six  of  which  Include  one  or  more  psychomotor  scales.  For 
these  six  OAPs,  there  is  little  evidence  of  differential  validity  for  males 
and  females.  Based  on  these  scant  data,  it  appears  as  though  the  GATB 
psychomotor  scales  are  equally  valid  for  males  and  females. 

Table  35  summarizes  the  data  on  ethnic  group  differences  in  GATB 
psychomotor  aptitude  scores  (U.S.  Department  of  Labor,  1970). 

The  sample  of  job  applicants  tested  in  local  California  employment 
offices  is  probably  most  representative  of  the  general  population.  For  all 
three  psychomotor  scales,  overall  differences  between  minority  and  nonmin¬ 
ority  applicants  Is  less  than  .1  standard  deviation.  Blacks  scored  lowest 
of  all  groups  on  all  three  psychomotor  scales,  while  Hispanlcs  and  Orien¬ 
tals  scored  higher  than  nonminority  applicants  on  all  three  scales.  Most 
group  differences  were  quite  small,  however. 

The  pattern  of  group  differences  was  similar  In  the  samples  of  welders 
and  draftsmen.  Nonminority  welders  scored  between  .4  and  .6  standard  de¬ 
viation  higher  than  minority  welders  on  the  three  psychomotor  scales.  All 
but  one  of  these  minority  welders  was  black.  In  the  first  sample  of 
draftsmen,  minority  draftsmen- -over  90%  of  whom  were  Hispanic  or  Oriental -- 
scored  approximately  .5  standard  deviation  higher  than  nonminority 
draftsmen  on  all  three  psychomotor  scales.  Hispanics  and  Orientals  also 
scored  higher  than  nonminorities  in  the  second  sample  of  draftsmen.  While 
separate  means  are  not  available  for  American  Indians  and  blacks  in  this 
sample,  the  data  certainly  suggest  that  blacks  scored  lower  than  Hispanics, 
Orientals,  and  nonminorities  on  all  three  scales. 

Unfortunately,  there  are  no  data  available  to  evaluate  whether  the 
psychomotor  scales  are  differentially  valid  for  minorities  and  nonminor¬ 
ities. 

In  summary,  the  group  differences  data  available  for  the  GATB  show 
significant  age  and  ethnic  group  differences  for  all  three  psychomotor 
scales  and  significant  sex  differences  for  Motor  Coordination  and  Finger 
Dexterity.  Since  the  data  on  age  differences  were  cross-sectional  rather 
than  longitudinal,  however,  and  since  no  data  were  available  on  the 
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Ethnic  Croup  Oiffcrencea  in  G»TB  Psychowotor  Aptitude  Scores 
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criterion-related  validity  of  the  6ATB  for  different  age  groups,  it  is 
difficult  to  determine  whether  the  GATB  is  biased  in  favor  of  younger 
subjects.  Similarly,  the  presentation  of  validity  data  for  males  and 
females  and  for  various  ethnic  groups  makes  it  impossible  to  determine 
whether  the  GATB  Is  a  valid  predictor  for  each  of  these  groups  and  whether 
the  observed  group  differences  In  test  scores  are  actually  related  to  job 
and/or  training  performance. 

Group  Differences  and  the  ACB.  During  World  War  II,  the  United  States 
armed  forces  were,  for  the  most  part,  segregated  by  race  and  sex.  Blacks 
trained  and  fought  in  separate  units  from  whites,  and  women  were  not  al¬ 
lowed  to  hold  combat  positions.  Nevertheless,  large  numbers  of  blacks  and 
women  were  needed  and  used  as  pilots  during  the  war.  Black  units  in  the 
AAF  played  a  vital  role  In  combat  In  Europe  and  the  Pacific,  and  women 
assigned  to  the  Women’s  Auxiliary  Service  Pilots  (WASPs)  were  responsible 
for  shuttling  troops,  supplies,  and  equipment  overseas. 

The  Aviation  Psychology  Program  research  team  administered  the  ACB  to 
samples  of  black  student  pilots  and  WASPs  during  World  War  II  in  an  effort 
to  determine  the  validity  of  this  battery  for  special  samples  of  aviation 
students.  Results  for  four  ACB  psychomotor  tests  (Melton,  1947)  are  sum¬ 
marized  In  Tables  36  and  37. 

Table  36  reveals  that  white  student  pilots  scored  higher  than  black 
student  pilots  on  all  four  psychomotor  tests.  Differences  In  mean  scores 
ranged  from  .17  standard  deviation  on  the  Santa  Ana  Finger  Dexterity  Test 
to  .44  standard  deviation  on  the  Two-Hand  Coordination  Test.  On  the  three 
tests  of  multilimb  coordination,  the  mean  black-white  differences  was  .29 
standard  deviation.  These  black-white  differences  are  quite  a  bit  smaller 
than  those  reported  for  cognitive-perceptual  abilities  In  that  literature 
review  and  may  be  of  little  or  no  practical  significance. 

Of  special  Interest  are  the  validities  of  these  psychomotor  tests  in 
the  black  sample.  While  median  validities  for  the  three  multilimb  coordi¬ 
nation  tests  ranged  from  .24  to  .26  across  several  samples  of  white  student 
pilots,  validities  in  the  black  sample  ranged  from  only  .04  to  .14.  (The 
Santa  Ana  Finger  Dexterity  Test  was  not  a  valid  predictor  of  graduation 
from  elementary  pilot  training  In  either  group.)  The  researchers  offered 
"no  explanation  of  the  failure  of  the  test(s)  In  the  case  of  the  negro 
students. ... (N)one  of  the  apparatus  tests  showed  appreciable  validity” 
(Melton,  1947,  p.  142).  This  Is  the  only  example  of  differential  validity 
of  psychomotor  tests  for  blacks  and  whites  uncovered  In  the  literature 
reviewed. 

An  Important  question  Is  whether  this  finding  of  differential  validity 
generalizes  to  other  psychomotor  abilities,  other  jobs,  and  other  samples 
of  blacks  and  whites.  The  student  pilots  who  completed  the  ACB  represented 
a  rather  select  group.  Service  members  who  wished  to  enter  flight  training 
first  had  to  pass  the  AAF  Qualifying  Examination.  This  examination  was 
comprised  of  several  cognitive  ability  tests,  including  verbal  aptitude, 
knowledge  of  current  affairs,  mechanical  aptitude,  mathematics,  practical 
judgment,  and  interpretation  of  data  (Davis,  1947).  Miller  (1947)  reported 
that  those  passing  the  AAF  Qualifying  Examination  represented  the  top  25% 
of  all  high  school  graduates.  Unfortunately,  Davis  did  not  report  any  data 
on  black-white  differences  for  the  examination.  If,  however,  one  assumes  a 
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is  the  median  correlation  across  several  classes  of  students 


Descriptive  Statistics  and  Validities  for  Selected  Psycho.otor  Tests 

in  the  aircrew  Classification  flattery  for  Samples  of  Male  and  fewale  Aviation  Students 


K 

to 


* 

Kl 

o* 

in 

par 

« 

o 

• 

* 

Kl 

INI 

m 

w 

— 

. 

a 

• 

• 

a 

a 

ft. 

P 

x  a 

L. 

... 

Ui 

—  *o 

a 

a 

w 

C  X 

> 

5i  v’ 

k  a  a 

a 

C 

—  3 

a 

a 

o* 

0 

V 

y  y 

0 

o 

Kl 

rvi 

M 

X  L  HI 

O  C 

a 

a 

• 

• 

• 

a 

— 

nay 

—  a 

a 

x 

• 

3  K  C 

—  TJ 

o 

in 

m 

m 

m 

X 

oca 

Q.  3 

ft. 

f  8? 

w 

u 

a 

>»  a 

a 

3 

—  aw 

ft. 

o 

0 

o 

<  —  a 

a  a 

Ol 

o 

in 

o 

• 

a 

w  a 

c 

» 

• 

• 

• 

a 

< 

a 

c 

X 

0 

in 

o 

a  *0  — 

a  — 

c 

*■# 

in 

>* 

oi  a  a 

e  — 

X 

c  w  m 

a  a 

a 

—  a 

—  3 

k. 

C  —  L. 

a 

w 

0 

O 

o 

0 

># 

—  a  o 

a 

o* 

o 

in 

a  s  *- 

o  a 

■ 

W 

• 

a 

a 

• 

u  o 

w 

O 

X 

rvj 

Kl 

m 

w— 

a 

w  u  a 

Ol 

ft- 

in 

in 

in 

in 

o 

w 

L.  C 

*•- 

X 

w  v  c 

o  — 

o  a  a 

—  c 

c 

w 

—  x  ■ 

u  — 

• 

o 

0 

0 

«- 

—  w  w 

a  a 

TJ 

K 

K. 

0 

K 

o 

O.  a 

k. 

a 

y 

D 

• 

• 

a 

a 

a 

ft.  a  *3  w 

y 

a 

• 

"U  a  m 

a 

a 

3 

X 

a  w  m 

w  O 

a 

e 

TJ 

u  h-  a 

a  *— 

w 

Ol 

a 

Kl 

O 

O 

m 

X 

c  a 

a  a 

c 

e 

k. 

vT 

0 

ni 

0 

u 

a  v  w  a 

c 

•V 

Ol 

Ol 

z 

e> 

O 

«. 

>  -D  w 

X 

o 

c 

c 

• 

• 

at 

• 

a 

-0  a  — 

a 

•«B 

a  TJ 

O 

K. 

a 

a  w  *o 

i-  3 

w 

a 

c 

a 

c 

c— 

VI 

a  •— 

a  c 

a 

k. 

a 

a 

4 

■  a  — 

3  a 

w 

w 

e 

a 

c 

X 

o  w  a 

3 

Ol 

i_ 

a 

i-  > 

*  >* 

a 

y 

C 

y 

■ 

a 

>* 

*-  a 

k.  u 

o 

O 

ft. 

01 

C  k 

a  a 

a 

w 

c 

E 

o 

o 

c  a  o 

>  w 

o 

a 

u 

_# 

O  3  a- 

a  c 

a  a 

L. 

a 

o 

3  a 

a 

1- 

tw  TJ 

o» 

X 

w  C  C 

o  ■ 

u 

TJ 

y 

c 

w 

i 

INI 

%# 

ni 

o 

u 

a  a  o 

x  a 

a 

a 

TJ 

a 

a 

■ 

fV 

Kl 

Kl 

a 

>* 

3  •  — 

c 

u 

■ 

a 

a 

a 

• 

• 

e 

a 

VI 

■o  O  ft- 

*  a 

a 

c 

O 

w 

a 

y 

C 

X 

a  3  a 

a 

Ol 

a 

ft- 

a 

a 

k  y 

a  ■ 

> 

*♦- 

C 

w 

— 

c 

x  a  — 

—  o 

a  tj 

— 

a 

a 

o 

a  ft. 

a  k. 

X 

a 

TJ 

■ 

3 

w 

•wm 

a  a  o 

■  •*- 

y 

a 

TJ 

c 

in 

N. 

*■» 

a  x 

■ 

w 

— 

a 

a 

Kl 

O 

0 

0 

n 

3  w  a 

a  *a 

c 

o 

a 

a 

ft. 

•O 

> 

• 

a 

a 

a 

>«- 

X 

x  a 

«- 

3 

01 

3 

X 

0 

0 

•- 

> 

a  w  i— 

1-  W 

•*- 

TJ 

a 

w 

*■# 

m 

0 

0 

< 

w  a 

a 

o 

a 

i- 

X 

a 

c  X 

c 

•— 

TJ 

ft- 

a 

w 

V* 

a  w  • 

• 

a 

Ol 

3 

o 

c 

€ 

*o  m 

Ol  ■ 

»*- 

w 

X 

a 

o 

0 

0 

O 

a 

Sac 

c  •- 

o 

a 

o 

O 

a 

a 

o 

o 

m 

0 

u 

a 

W  k  — 

•«—  — • 

3 

X 

X 

Ol 

3 

# 

• 

e 

a 

a 

o 

u 

a  a  c 

c  a 

C  TJ 

3 

3 

c 

w 

x 

in 

Kl 

o* 

0 

u. 

a  — 

o 

a 

a 

>r 

Kl 

in 

-»# 

a  a  a 

a  a 

u 

a 

a  TJ  X 

«- 

—  a  i- 

k.  w 

w 

Ol 

y 

w 

3 

— 

o 

a  a  w 

w  C 

a 

C 

C 

— 

c 

< 

■ 

a 

•«. 

o 

a 

a 

M 

o 

rsl 

0 

CNJ 

Ol 

a  wo 

w  T3 

> 

X 

? 

TJ 

c 

•y 

0 

0 

** 

>■  c 

*-  —  a 

O  3 

a 

3 

3 

y 

• 

• 

a 

a 

■ 

o 

—  y 

TJ 

w 

w 

a 

X 

o* 

csi 

ft- 

w 

ft.  »  c 

•—  a 

a 

a 

a 

• 

Kl 

0 

X 

c 

o  W  a 

a 

TJ 

a 

a 

a 

w 

•w  k.  > 

■D 

ft. 

*y 

a 

a 

ft. 

k. 

c 

O  T> 

T3  V 

a 

a 

X 

X 

o 

k. 

•1 

X 

a  o.  a 

a  "o 

? 

TJ 

y 

w 

o 

o 

o 

e— 

w  a 

o  3 

a 

o 

n 

N- 

N» 

V) 

«i 

C  C  c 

C  -y 

a 

TJ 

«- 

Q 

• 

• 

• 

a 

a 

c 

a  a 

a  u 

w 

C 

O 

O 

w 

— 

*-» 

a 

•  o> 

>  C 

• 

a 

a 

a 

a 

E 

a  -  a 

T3 

Ol 

u 

a 

a 

a 

•— 

V* 

C 

a  c  x 

a 

c 

a 

t_ 

c. 

w 

ft. 

3 

u 

a  o 

a 

o 

o 

a 

X 

o 

a* 

o 

W 

a 

a  w  >. 

E  a 

c 

■p 

O 

o 

o 

C 

a 

V- 

•• 

a 

> 

a  —  a 

o  a 

C 

a 

a 

a 

k. 

o 

a  a  x 

ft-  c 

a 

a 

C 

a  x 

a 

o 

X  w 

a  — 

ft- 

o 

w 

w 

■ 

a 

>* 

■ 

w 

o 

•p> 

a 

a 

c 

0 

a 

w  ■  a 

C  — 

m 

w 

a 

a 

— 

a 

< 

VO 

n-  o  u 

o  — 

TJ 

ft. 

y 

y* 

— 

*0  ft-  o 

—  a 

a 

o 

a 

TJ 

*•* 

■ 

a 

—  **- 

w 

o 

o 

CL 

C 

C 

u 

a 

•«- 

0 

3 

—  a 

a  a 

c 

O 

a 

a 

a 

E 

ft. 

ft- 

a  i.  x 

c  — 

a 

c 

u 

a 

a 

> 

9 

V 

aa 

>  a 

—  a 

> 

a 

CL  ■ 

E 

o 

a 

4-* 

c 

W 

U 

a  w 

■  e  -o 

a 

X 

VI 

c 

o 

X 

« 

O 

S  «i 

•*. 

a 

s 

a 

a 

a 

a 

y 

• 

o 

a 

ft. 

» 

X 

X 

X 

X 

a 

W 

o 

fl 

• 

a  a 

■ 

a 

w 

W 

w 

W 

a 

a 

w 

• 

c 

V  o 

3 

o 

w 

a 

ft. 

a 

c 

u 

fl 

o 

c  —  c 

c  x 

k. 

TJ 

a 

a 

a 

a 

c 

a 

O 

c 

a 

K 

y 

o  a 

O  ft- 

*•- 

a 

w 

w 

w 

w 

a 

a 

*-» 

•» 

TJ 

— V 

Ol 

K 

o* 

•-  w  c 

X 

C 

C 

C 

c 

to 

w 

o 

TJ 

ft. 

o 

c 

V 

C 

»-  a  •- 

c 

TJ 

a 

a 

a 

a 

a 

O 

e 

«- 

O 

u 

a  —  a 

o  • 

a 

a 

a 

a 

a 

a 

ft. 

o 

o 

o 

w 

fty 

« 

X 

W  Q.  ft- 

—  Ol 

w 

y 

a 

a 

a 

a 

a 

Ol 

x 

o 

u 

c 

X 

V) 

—  1  w 

w  C 

a 

a 

t- 

S- 

u 

u 

a 

c 

o 

u 

o 

• 

K 

a 

ft-  o 

a  — 

C 

a 

a 

a 

a  a 

k. 

>* 

■O 

u 

c 

3 

u  o  o 

3  c 

w 

a 

a 

a 

a 

c 

M 

X 

c 

< 

TJ  — 

c 

u 

k. 

k. 

u 

a— 

a. 

V 

a 

ft- 

a  w  a 

a  a 

•— 

—a 

— 

X 

«» 

a 

• 

x  O  a 

«-  k 

— 

— 

•i 

3 

3 

j 

3  XI- 

CL 

• 

tj 

w 

j  a  n 

»-  C  X 

Ol  W 

a 

< 

p 

H 

X 

X 

ft. 

w 

E 

o 

tj 

C 

O 

3 

3 

• 

o 

O* 

• 

• 

# 

• 

• 

■ 

• 

U 

*- 

X 

VI 

X 

a 

X 

O  TJ 

a 

•ft. 

148 


black-white  difference  of  one  standard  deviation  for  AAF  Qualifying  Exami¬ 
nation  scores  (see  the  summary  of  black-white  cognitive  ability  differences 
in  that  literature  review),  then  only  about  10%  of  the  black  population 
would  have  been  able  to  pass  the  examination. 

One  result  of  this  screening  with  the  AAF  Qualifying  Examination  is 
that  validity  coefficients  for  any  subsequent  screening  tests  could  pos¬ 
sibly  be  severely  attenuated  for  blacks.  This  may  explain  some  of  the 
differences  in  validity  coefficients  for  psychomotor  tests  for  blacks  and 
whites.  Since  correlations  between  multilimb  coordination  and  cognitive 
ability  tests  are  quite  low,  however  (see  Table  32),  there  was  probably  not 
a  great  deal  of  range  restriction  in  multilimb  coordination  test  scores  for 
the  student  pilot  candidates  who  completed  the  ACB  (vis-a-vis  the  general 
population).  Therefore,  range  restriction  cannot  explain  all  of  the  dif¬ 
ferences  in  these  validity  coefficients.  Moreover,  the  proportion  of 
entering  student  pilots  who  actually  graduated  from  pilot  training  was 
approximately  .80  for  whites  and  .67  for  blacks.  Since  correlations  be¬ 
tween  dichotomous  and  continuous  variables  are  attenuated  as  the  split  on 
the  dichotomous  variable  departs  from  .50-. 50,  the  validity  coefficients 
for  whites  are  actually  attenuated  slightly  more  than  those  for  blacks. 

A  comparison  of  mean  ACB  psychomotor  test  scores  for  males  and  females 
is  even  more  difficult  to  interpret  because  of  differences  in  the  sampling 
of  these  two  groups. 

While  it  is  not  completely  clear  from  the  description  provided  by 
Melton  (1947),  It  appears  that  the  women  tested  were  WASPs  who  had  grad¬ 
uated  from  elementary  and  basic  training  and  who  were  preparing  to  enter 
advanced  training.  As  noted  previously,  the  AAF  Qualifying  Examination 
eliminated  all  but  the  top  25%  of  high  school  graduates  from  pilot 
training.  Typically  then,  35-40%  of  those  who  did  pass  the  AAF  Qualifying 
Examination  were  eliminated  prior  to  advanced  training  (Miller,  1947). 

These  eliminations  were  generally  the  result  of  flight  deficiencies  which 
occurred  during  elementary  and  basic  training.  Thus,  the  WASPs  involved  in 
this  research  had  already  soloed  successfully  during  elementary  training 
and  demonstrated  proficiency  on  a  number  of  basic  flight  maneuvers  during 
basic  training.  If  one  assumes,  based  on  validity  results  presented  for 
male  student  pilots  In  Table  9,  that  multilimb  coordination  Is  a  valid 
predictor  of  graduation  from  elementary  flight  training  for  women  (cf. 

Table  36),  then  the  WASPs  tested  for  this  research  probably  scored  higher 
on  tests  of  multilimb  coordination  than  females  In  general.  Therefore,  the 
correlations  between  multi  limb  coordination  and  graduation  from  advanced 
training  for  women  reported  In  Table  37  should  be  somewhat  attenuated  due 
to  range  restriction.  In  addition,  the  mean  test  scores  reported  in  the 
table  are  probably  an  overestimate  of  the  mean  scores  for  women  in  general. 

The  males,  whose  data  are  reported  in  Table  37,  were  tested  prior  to 
elementary  flight  training.  Like  their  female  counterparts,  these  men  were 
screened  with  the  AAF  Qualifying  Examination  prior  to  testing.  Unlike  the 
WASPs,  though,  these  men  had  not  yet  demonstrated  their  flight  aptitude  in 
elementary  or  basic  training.  Since  the  cognitive  screening  tests  used  in 
the  AAF  Qualifying  Examination  are  not  highly  correlated  with  multilimb 
coordination,  there  is  probably  not  a  great  deal  of  range  restriction  with 
respect  to  multilimb  coordination  for  these  men. 
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Given  these  sampling  differences,  one  might  expect  the  following: 

1.  Mean  scores  on  tests  of  multlllmb  coordination  will  be 
slightly  higher  for  females  than  males.  This  Is  because 
WASPs  scoring  low  on  these  tests  would  be  more  likely  to 
attrlte  during  elementary  and  basic  flight  training  than 
WASPs  scoring  high  on  these  tests.  Male  student  pilots,  on 
the  other  hand,  would  not  have  had  an  opportunity  to  attrlte 
during  elementary  or  basic  training  prior  to  testing. 

2.  Multlllmb  coordination  validity  coefficients  for  females 
will  be  attenuated  relative  to  those  for  males.  This  is 
because  the  range  of  scores  for  females  on  multi limb  coordi¬ 
nation  tests  will  be  reduced  due  to  the  elevated  attrition 
rate  during  elementary  and  basic  training  for  WASPs  with 
relatively  low  standing  on  this  ability. 

Table  37  shows  that  neither  of  these  predictions  was  upheld.  First, 
males  scored  .53  standard  deviation  higher  than  females  on  the  Complex 
Coordination  Test  and  1.40  standard  deviations  hlgher'than  females  on  the 
Two-Hand  Coordination  Test.  Females,  however,  scored  1.06  standard  devia¬ 
tions  higher  than  males  on  the  Rudder  Control  Test.  Yet  all  three  of  these 
tests  are  measures  of  multlllmb  coordination.  One  possible  explanation  for 
the  large  female  advantage  on  the  Rudder  Control  Test  Is  that  the  behaviors 
required  on  this  apparatus  test  are  very  similar  to  those  required  In 
flying  a  plane.  The  females  tested  all  had  previous  flight  experience, 
while  none  of  the  males  did.  Melton  (1947)  reported  a  correlation  of  .68 
between  previous  flying  experience  and  scores  on  the  Rudder  Control  Test. 
Thus,  this  experience  may  account  for  the  high  mean  score  for  females  on 
this  test.  Nevertheless,  from  the  data  presented  In  Table  37,  It  Is  ex¬ 
tremely  difficult  to  determine  whether  there  are  sex  differences  In  mean 
level  of  multlllmb  coordination  ability.  Second,  the  validity  coefficients 
for  multlllmb  coordination  are  very  similar  for  females  and  males.  On  the 
average,  the  validities  appear  to  be  just  slightly  higher  for  males. 

Data  In  Table  37  also  Indicate  that  males  scored  .34  standard  devia¬ 
tion  higher  than  females  on  the  Santa  Ana  Finger  Dexterity  Test.  The  test 
was  not  a  valid  predictor  of  training  attrition  for  either  group. 

In  sum,  the  data  on  group  difference-  from  the  AAF  Aviation  Psychology 
Program  pose  more  questions  than  they  answer.  White  males  scored  slightly 
higher  than  black  males  on  three  different  measures  of  multilimb  coordina¬ 
tion.  None  of  the  tests  was  a  valid  predictor  of  attrition  from  elementary 
pilot  training  for  blacks,  while  all  three  tests  were  valid  predictors  in 
white  student  pilot  samples.  Sex  differences  in  scores  on  multi limb  co¬ 
ordination  tests  were  more  pronounced  than  ethnic  group  differences.  Un¬ 
fortunately,  the  direction  of  these  sex  differences  was  Inconsistent. 

Males  scored  considerably  higher  than  females  on  two  multlllmb  coordination 
tests,  but  females  scored  quite  a  bit  higher  than  males  on  a  third  test  of 
this  ability.  Additional  analyses  of  these  data  indicated  no  significant 
differences  In  test  validities  for  males  and  females.  Finally,  on  the 
Santa  Ana  Finger  Dexterity  Test,  white  males  scored  slightly  higher  than 
black  males,  who  In  turn  scored  slightly  higher  than  females.  The  test  was 
not  a  valid  predictor  of  graduation  from  flight  training  for  any  of  the 
groups. 
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CCAAB 1 .  The  CCAAB  was  developed  to  aid  in  the  selection  of  foreign  stu¬ 
dents  receiving  aircrew  training  from  the  U.S.  Air  Force  (Mullins,  Keeth, 

&  Riederlch,  1968).  At  the  time,  many  foreign  students  were  coming  to 
this  country  for  aircrew  training  under  the  auspices  of  the  Military  Assis¬ 
tance  Program  (MAP).  The  goal  of  MAP  was  to  provide  technical  and  pilot 
training  to  members  of  the  armed  forces  from  allied  countries. 

One  problem  with  MAP  was  that  different  countries  used  different  selection 
procedures  when  choosing  students  to  participate  in  the  program.  To  help 
solve  this  problem,  the  Air  Force  decided  "to  develop  a  cross-cultural, 
cross-language  battery  of  tests  which  could  be  used  by  the  individual 
countries,  regardless  of  language  and  cultural  differences,  to  Improve 
their  selection  procedures  and  aid  In  predicting  success  of  those  chosen 
for  pilot  or  technical  training"  (Mullins  et  al.,  p.  1).  The  initial  CCAAB 
was  comprised  of  27  tests,  including  seven  paper-and-pencll  and  three 
apparatus  psychomotor  tests. 

Results  from  the  initial  validity  study  for  pilots  were  quite  prom¬ 
ising.  Subjects  were  120  pilot  trainees  from  a  range  of  foreign  countries. 

Two  of  the  apparatus  tests,  Rudder  Control  and  Complex  Coordination,  corre¬ 
lated  significantly  with  graduation  from  pilot  training. 

In  a  subsequent  follow-up  study,  the  CCAAB  was  administered  to  a  sample  of 
244  South  Vietnamese  pilot  trainees  (Croll  et  al.,  1973).  Because  the 
CCAAB  tests  were  administered  by  Vietnamese  personnel  in  Vietnam,  Croll  et 
al.  decided  that  It  would  be  too  complicated  for  the  administrators  to 
oversee  the  apparatus  tests.  Therefore,  only  the  paper-and-pencll  tests 
were  administered.  Two  of  the  seven  paper-and-pencll  psychomotor  tests 
correlated  more  than  .20  with  graduation  from  pilot  training.  Line  Control 
(a  measure  of  arm-hand  steadiness)  correlated  .24  with  graduation,  while 
Trace  Tapping  II  (a  measure  of  aiming)  correlated  .29  with  graduation. 

In  the  first  of  these  two  CCAAB  studies,  Mullins  et  al .  (1968)  did  not 
score  their  apparatus  tests  In  the  same  measurement  units  which  previous 
researchers  had  used.  Therefore,  It  was  not  possible  to  compare  mean  appa¬ 
ratus  test  scores  for  these  foreign  student  pilots  with  mean  apparatus  test 
scores  for  other  groups.  Nor  was  it  possible  to  compare  the  performance  of 
these  subjects  on  the  paper-and-pencil  psychomotor  tests  with  the  perfor¬ 
mance  of  other  groups  of  subjects.  This  was  because  the  paper-and-pencll 
psychomotor  tests  were  all  new  measures  which  the  researchers  developed 
specifically  for  the  CCAAB. 

It  was  possible,  however,  to  compare  the  validities  Mullins  et  al . 

(1968)  obtained  for  the  apparatus  tests  with  validity  results  which  Melton 
(1947)  reported  from  the  AAF  Aviation  Psychology  Program  during  World 
War  II.  These  results  are  summarized  In  Table  38.  Validities  for  Complex 
Coordination  and  Rudder  Control,  both  of  which  are  measures  of  multilimb 
coordination,  were  approximately  equal  in  the  two  groups.  The  validities 
for  the  control  precision  test  Rotary  Pursuit,  however,  were  quite  dif¬ 
ferent  (median  r-. 23  for  American  student  pilots,  r— .01  for  foreign  stu¬ 
dent  pilots).  While  these  data  suggest  that  Rotary  Pursuit  may  be  a  dif¬ 
ferentially  valid  predictor  of  graduation  from  pilot  training  for  these  two 
groups,  such  conclusions  must  be  tentative  until  additional  validity  data 
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Table  38 


Validity  of  Three  Psvchomotor  Tests  for  Predicting  Graduation 
from  Pilot  Training  for  Foreign  and  American  Student  Pilots 


_ Subjects _ 

Foreign  Student 
Pilots  Training 
with  the 

U.  S.  Air  Force 

In  1968a 

American  Student 
Pilots  Training 
with  the  U.  S. 
Army  Air  Forces 
during  . 

_ World  War  IIb 

Complex  Coordination 

.21 

.26 

Rudder  Control 

.38 

.24 

Rotary  Pursuit 

-.01 

.23 

a.  The  data  In  this  column  are  from  Selection  of  foreign  students 
for  training  In  the  United  States  Air  Force  (AFHRL-TR-68-111) 
(p.  7)  by  C.  0.  Mullins,  J.  B.  Keeth,  and  L.  D.  Riederich, 
1968,  Lackland  Air  Force  Base,  TX:  U.  S.  Air  Force  Human 
Resources  Laboratory,  Personnel  Research  Division.  The 
validity  coefficients  reported  in  this  table  are  mean 
validities  over  five  trials  for  Complex  Coordination,  six 
trials  for  Rudder  Control,  and  three  trials  for  Rotary 
Pursuit.  M-120  for*  all  tests  and  trials.  The  criterion  was 
graduation  from  pilot  training. 

b.  The  data  In  this  column  are  from  Apparatus  tests  (Army  Air 
Forces  Aviation  Psychology  Program  Research  Report  No.  4)  by 
A.  W.  Melton  (Ed.),  1947,  Washington,  DC:  U.  S.  Government 
Printing  Office.  The  validity  coefficients  reported  In  this 
table  are  median  validities  across  several  classes  of  student 
pilots.  iM0,527  for  Complex  Coordination  (18  classes  of 
student  pilots),  ii-10,278  for  Rudder  Control  (six  classes  of 
student  pilots),  and  H-8,955  for  Rotary  Pursuit  (seven  classes 
of  student  pilots).  The  criterion  was  graduation  from 
elementary  pilot  training. 
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for  foreign  student  pilots  are  collected. 


Summary  of  Group  Differences  Data.  Table  39  summarizes  the  data  on 
group  differences  for  each  of  the  nine  major  psychomotor  abilities.  No 
data  are  available  for  four  of  these  abilities:  rate  control,  aiming,  arm- 
hand  steadiness,  and  speed  of  arm  movement.  In  addition,  there  are  no  data 
on  average  group  tests  score  differences  for  control  precision. 

Mean  scores  for  whites  on  tests  of  multilimb  coordination,  finger 
dexterity,  manual  dexterity,  and  wrist-finger  speed  are  typically  higher 
than  those  for  blacks.  Differences  in  mean  scores  range  from  .2  to  .5 
standard  deviation.  These  differences  are  quite  a  bit  smaller  than  the 
average  black-white  difference  of  .one  standard  deviation  for  cognitive 
ability  tests  reported  In  the  previous  chapter.  Additional  data  from  the 
GATB  Indicate  that  Hlspanics  and  Orientals  score  higher  than  whltas  on  all 
three  psychomotor  scales.  Average  differences  range  froff  A  standard  de¬ 
viation  on  the  Finger  Dexterity  scale  to  1.3  standard  deviations  on  the 
Motor  Coordination  ( 1 . e . ,  wrist-finger  speed)  scale.  American  Indians  also 
score  higher  than  whites  on  two  scales,  the  Manual  Dexterity  scale  (.7 
standard  deviation)  and  the  Motor  Coordination  scale  (.3  standard  d(?via* 
tion) . 

Data  on  sex  differences  show  that  males  score  higher  than  fenuftls  on 
tests  of  finger  dexterity  (.35  standard  deviation)  and  wrist-finger  speed 
(.4  standard  deviation).  There  are  no  sex  differences  In  mean  scores  on 
tests  of  manual  dexterity.  Large  sex  differences  have  been  found  on  three 
tests  of  multilimb  coordination.  The  direction  of  these  differences  is 
Inconsistent,  however.  In  research  conducted  during  World  War  II,  males 
scored  an  average  of  one  standard  deviation  higher  than  females  on  two 
tests  of  multilimb  coordination  while  females  scored  Oightly  more  than  one 
standard  deviation  higher  than  males  on  a  third  mult'imb  coordination 
test. 


Very  little  information  Is  available  regarding  differential  valid¬ 
ity  of  psychomotor  tests.  The  available  validity  data  sugge  t  that  tests 
of  multilimb  coordination,  finger  dexterity,  manual  dexter >  ,  and  wrist- 

finger  speed  are  equally  valid  predictors  of  job  and  training  performance 
for  males  and  females.  Two  possible  Instances  of  d1ffervntial  validity 
were  uncovered,  however.  First,  tests  of  multilimb  coordination  had  no 
validity  as  predictors  of  graduation  from  training  for  black  student  pi¬ 
lots.  Ironically,  these  tests  are  the  most  valid  psychomotor  predictors  of 
graduation  from  flight  training  for  white  male  and  female  student  pilots. 
Second,  Rotary  Pursuit  did  not  predict  graduation  from  training  in  a  sample 
of  foreign  student  pilots.  For  white  American  student  pilots,  however,  the 
average  validity  of  ’Ms  control  precision  test  was  .23  across  seven  dif¬ 
ferent  samples. 

The  Use  of  Single-  vs.  Multiple-Construct  Predictors 

One  of  the  major  decisions  facing  any  selection  researcher  is  whether 
to  use  "factor-pure"  single-construct  ability  measures  or  factorially  com¬ 
plex  multiple-construct  work  sample  measures  as  performance  predictors. 

Researchers  who  are  primarily  interested  in  understanding  the  abilities 
Involved  in  performing  a  job  often  prefer  to  use  well-established  tests  and 
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Table  39 


Summary  of  Group  Differences  Data  for  Nine  Psvchomotor  Abilities 


Summary  of  Summary  of 

Psychomotor  Group  Differences  Differential  Validity 

Ability _ in  Average  Test  Scores  Evidence _ 


Multi limb  White  male  student  pi- 

Coordinatlon  lots  scored  slightly 

higher  than  black  male 
student  pilots  on  three 
different  tests,  but 
the  average  difference 
was  only  .29  standard 
deviation.  Male- 
female  differences  were 
Inconsistent.  White 
male  student  pilots 
scored  an  average  of 
.96  standard  deviations 
higher  on  two  tests  of 
this  ability,  but  fe¬ 
males  scored  1.06  stan¬ 
dard  deviations  higher 
on  a  third  test. 


Control  No  data  available. 

Precision 


Rate  No  data  available. 

Control 


Tests  were  equally  val¬ 
id  predictors  of  gradu¬ 
ation  from  pilot  train¬ 
ing  for  white  males, 
white  females,  and  male 
foreign  student  pilots, 
with  validities  ranging 
from  .22  to  .55  across 
various  samples.  Tests 
were  uncorrelated  with 
graduation  from  pilot 
training  for  black 
males,  however,  with 
validities  for  three 
tests  ranging  from  .04 
to  .14. 


Rotary  Pursuit  was  a 
valid  predictor  of 
graduation  from  pilot 
training  for  white 
American  males  (median 
I-.23  across  seven 
classes  of  pilot  train¬ 
ees),  but  had  no  valid¬ 
ity  for  foreign  student 
pilots  (mean  r*-*01 
across  three  trials). 

No  data  available. 


(Continued) 
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Table  39  (Continued) 


Psychomotor 


Summary  of 
Group  Differences 


Summary  of 

Differential  Validity 


Finger  White  male  student  pi-  The  Santa  Ana  Finger 

Dexterity  lots  scored  approxi-  Dexterity  Test  was  not 

mately  .2  standard  de-  a  valid  predictor  of 
viation  higher  than  graduation  from  pilot 

black  male  student  pi-  training  for  blacks, 
lots  on  the  Santa  Ana  whites,  or  women.  Data 
Finger  Dexterity  Test.  from  the  GATB  provide 
In  turn,  black  male  no  evidence  of  differ- 

student  pilots  scored  entlal  validity  of  the 

approximately  .2  stan-  GATB  Finger  Dexterity 
dard  deviation  higher  scale  for  males  and 

than  female  student  females, 

pilots.  Data  for  sev¬ 
eral  samples  from  the 
GATB  Finger  Dexterity 
scale  showed  that  HIs- 
panlcs  and  Orientals 
scored  an  average  of  .4 
standard  deviation 
higher  than  whites, 
while  whites  scored  an 
average  of  .5  standard 
deviations  higher  than 
blacks.  There  was  no 
difference  In  the  ave¬ 
rage  scores  of  whites 
and  American  Indians. 

In  a  sample  of  7,000 
high  school  students, 
boys  averaged  .35  stan¬ 
dard  deviation  higher 
than  girls  on  this  GATB 
scale.  Other  GATB  re¬ 
search  has  demonstrated 
that  older  subjects  and 
subjects  from  lower  SES 
groups  score  lower  on 
this  scale  than  younger 
subjects  and  subjects 
from  higher  SES  groups, 
respectively. 

(Continued) 
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Table  39  (Continued) 


Summary  of  Group  Differences  Data  for  Nine  Psychomotor  Abilities 


Summary  of  Summary  of 

Psychomotor  Group  Differences  Differential  Validity 

Ability _ in  Average  Test  Scores  Evidence _ 


Manual 

Dexterity 


Hlspanics,  Orientals, 
and  American  Indians 
scored  an  average  of  .7 
standard  deviation 
higher  than  whites  on 
the  GATB  Manual  Dexter¬ 
ity  scale.  Whites  in 
turn  scored  an  average 
of  .4  standard  devia¬ 
tion  higher  than 
blacks  on  this  scale. 
Data  from  a  large  sam¬ 
ple  of  high  school  stu¬ 
dents  showed  no  sex 
differences  on  this 
GATB  scale.  Additional 
analyses  with  GATB  data 
revealed  significant 
SES  effects  but  no  sig¬ 
nificant  age  effects. 

(Continued) 


There  was  no  evidence 
of  differential  valid¬ 
ity  of  the  GATB  Manual 
Dexterity  scale  for 
males  and  females.  No 
data  were  available  to 
evaluate  the  differen¬ 
tial  validity  of  this 
scale  for  minorities 
and  nonminorities. 
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Table  39  (Continued) 


Summary  of  Group  Differences  Data  for  Nine  Psvchomotor  Abilities 


Summary  of 

Psychomotor  Group  Differences 

Ability _ in  Average  Test  Scores 


Wrist- 

Finger 

Speed 


Aiming 

Arm-Hand 

Steadiness 


Orientals  averaged  al¬ 
most  1.3  standard  devi¬ 
ations  higher  than 
whites  on  the  GATB  Mot¬ 
or  Coordination  scale, 
while  Hlspanics  and 
American  Indians  ave¬ 
raged  .3  standard  devi¬ 
ations  higher  than 
whites.  Blacks  scored 
an  average  of  .5  stan¬ 
dard  deviations  lower 
than  whites  on  this 
scale.  In  a  large  sam¬ 
ple  of  high  school  stu¬ 
dents,  boys  averaged  .4 
standard  deviations 
higher  than  girls  on 
this  scale.  Additional 
GATB  data  show  that 
scores  on  this  scale 
are  negatively  corre¬ 
lated  with  age  and  pos¬ 
itively  correlated  with 
SES. 

No  data  available. 

No  data  available. 


Speed  of  No  data  available. 

Arm  Movement 


Summary  of 

Differential  Validity 
Evidence _ _ 

There  was  no  evidence 
of  differential  valid¬ 
ity  of  the  GATB  Motor 
Coordination  scale  for 
males  and  females.  No 
data  were  available  to 
evaluate  the  differen¬ 
tial  validity  of  this 
scale  for  minorities 
and  nonminorities. 


No  data  available. 
No  data  available. 

No  data  available. 
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constructs  for  ability  measures.  In  this  manner,  they  can  specify  the 
abilities  underlying  job  performance  with  more  confidence  based  on  their 
knowledge  and  understanding  of  the  test  and  ability  construct  they  have 
elected  to  measure.  In  addition,  they  can  compare  ability  requirements  for 
different  jobs  to  determine  the  abilities  common  to  performance  within  and 
between  jobs  or  job  families. 

Conventional  wisdom,  however,  suggests  that  the  best  way  to  predict 
job  performance  is  to  use  predictors  which  are  similar  in  content  to  the 
job  (i.e.,  predictors  which  are  "samples"  of  performance;  Wernimont  & 
Campbell,  1968).  Researchers  who  favor  this  approach  note  that  performance 
on  most  jobs  requires  a  complex  blend  of  knowledges,  skills,  and  abilities, 
and  that  single-construct  predictors  fail  to  tap  this  complexity. 

In  reality,  there  is  no  answer  to  the  question  of  whether  factor-pure 
single-construct  measures  or  factorially  complex  multiple-construct  mea¬ 
sures  are  better  predictors.  This  is  because  there  is  no  such  thing  as  a 
factor-pure  single-construct  measure.  Previously,  in  the  discussion  of  the 
taxonomy  of  the  psychomotor  domain,  it  was  shown  that  abilities  vary  along 
a  continuum  of  complexity.  Within  the  psychomotor  domain,  for  example,  the 
psychomotor  abilities  identified  by  Fleishman  actually  consist  of  a  number 
of  more  elementary  perceptual  and  motor  abilities.  It  was  also  suggested 
that  the  abilities  comprising  Fleishman’s  psychomotor  taxonomy  could  in 
turn  be  grouped  into  more  complex  abilities,  and  that  ultimately  one  could 
probaoly  identify  a  general  psychomotor  coordination  or  "klutziness"  abili¬ 
ty  which  would  be  analogous  to  general  intelligence  (i.e.,  g)  within  the 
cognitive  domain.  Finally,  it  was  concluded  that  the  appropriate  level  of 
complexity  for  an  ability  taxonomy  for  any  selection  research  is  dependent 
upon  the  complexity  of  the  criteria  one  is  attempting  to  predict.  That  is, 
if  one  were  attempting  to  predict  performance  on  tasks  which  require  a 
number  of  perceptual  and  psychomotor  skills  and  abilities,  the  best  course 
would  be  to  choose  predictors  which  were  similar  to  those  tasks  and  which 
would  thus  tap  those  required  skills  and  abilities.  Conversely,  if  the 
ability  requirements  of  a  task  were  quite  simple,  then  measures  of  more 
basic  processes  would  likely  be  the  best  performance  predictors. 

This  approach  to  the  selection  of  predictors  is  not  unlike  that  de¬ 
scribed  in  the  section  on  content  validity  and  content-oriented  test  devel¬ 
opment.  There  again  It  was  suggested  that  tests  which  are  similar  in 
content  to  the  actual  tasks  performed  on  the  job  (I.e.,  samples  of  perfor¬ 
mance)  would  be  the  most  valid  predictors  of  performance. 

Jobs  which  are  heavily  dependent  upon  psychomotor  (vs.  cognitive) 
abilities  would  seem  to  be  particularly  good  candidates  for  multiple- 
construct  work  sample  selection  tests.  For  example,  many  factory  and 
repair/maintenance  jobs  can  be  described  In  terms  of  a  small  number  of 
psychomotor  tasks.  Researchers  can  Identify  these  tasks  via  job  analysis 
and  then  use  the  tasks  as  the  basis  for  work  sample  selection  tests. 
Unfortunately,  there  are  often  factors  which  make  It  Impractical  or  even 
unwise  to  use  such  work  sample  tests  as  predictors. 

First,  as  research  by  Adams  (1953,  1957),  Fleishman  (1960;  Fleishman  & 
Hempel,  1954b,  1955;  Parker  &  Fleishman,  I960),  Hlnrlchs  (1970),  and  others 
has  demonstrated,  the  psychomotor  abilities  accounting  for  variance  in  task 
performance  change  as  a  task  is  practiced  and  learned.  This  implies  that 
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the  psychomotor  tests  which  are  valid  as  predictors  of  Initial  learning 
rate  may  have  little  or  no  relationship  to  a  subject’s  asymptotic  level  of 
task  performance.  Indeed,  if  the  abilities  involved  in  Initial  task 
learning  are  totally  different  from  the  abilities  involved  in  task  perfor¬ 
mance  after  the  task  has  been  practiced  for  many  trials,  then  work  sample 
tests  based  on  that  task  may  be  completely  Invalid  as  predictors  of  ulti¬ 
mate  task  proficiency. 

Second,  generally  speaking,  It  Is  quite  expensive  to  develop  work 
sample  psychomotor  tests.  This  is  due  in  part  to  the  fact  that  many  of  the 
jobs  requiring  high  levels  of  psychomotor  abilities  Involve  the  use  of 
extremely  expensive  equipment  and  machinery  (e.g.,  pilots,  truck  drivers, 
and  tank  gunners).  If  the  test  setting  is  to  cov'  respond  closely  to  the  job 
setting,  it  may  be  necessary  to  develop  very  cor.  y  job  simulators.  Some 
researchers  have  circumvented  this  problem  by  sacrificing  situational 
and/or  task  fidelity  In  their  tests.  For  example,  the  Rudder  Control 
apparatus  developed  by  the  AAF  research  team  (Melton,  1947;  see  Appendix  A) 
bears  only  a  moderate  resemblance  to  the  cockpit  of  an  airplane.  Since  the 
motor  movements  required  to  operate  the  apparatus  are  very  similar  to  those 
required  to  fly  an  airplane,  the  test  has  good  content  validity  (cf. 
Anastasi,  1976).  Nevertheless,  the  use  of  a  relatively  Inexpensive  fac¬ 
simile  like  the  Rudder  Control  apparatus  makes  the  test  as  much  a  "sign"  as 
a  "sample"  of  job  performance.  While  operation  of  the  Rudder  Control 
apparatus  certainly  requires  a  complex  integration  of  perceptual  and  psy¬ 
chomotor  abilities,  It  does  not  even  begin  to  compare  with  the  perceptual - 
psychomotor  demands  of  flying  an  airplane. 

A  third  factor  which  decreases  the  utility  of  work  sample  tests  Is 
that  most  work  sample  tests  are  of  rather  limited  generality.  As  noted 
above,  the  behaviors  comprising  the  performance  domain  of  any  job  typically 
require  a  complex  blend  of  abilities  which  are  almost  never  duplicated  In 
any  other  job.  This  may  not  present  test  development  problems  if  one  Is 
only  attempting  to  predict  job  proficiency  for  a  handful  of  closely  related 
jobs  which  are  each  comprised  of  just  three  or  four  key  tasks.  For  the 
current  research  effort,  however,  the  Army  Is  attempting  to  predict  job 
proficiency  for  19  different  job  families .  Given  our  situation,  it  makes 
more  sense  to  Identify  a  finite  set  of  personal  constructs  underlying 
performance  on  the  various  jobs  and  to  develop  "signs"  tests  to  measure 
these  constructs  and  to  predict  training  performance  or  job  proficiency. 

Reliance  on  a  construct-oriented  test  development  strategy  does  not 
preclude  consideration  of  content  validity  In  the  development  of  these 
tests.  To  the  extent  that  it  Is  possible  to  introduce  tasks  or  elements  of 
the  job  environment  which  are  common  across  job  families  Into  the  personal 
construct  measures,  this  should  be  done.  According  to  most  experts,  In¬ 
creasing  the  similarity  of  the  tests  and  the  jobs  should  Increase  the 
criterion-related  validity  of  the  tests.  It  should  also  Increase  the  face 
validity  of  the  tests,  which  may  have  a  favorable  Impact  upon  subjects’ 
motivation  to  perform  well  on  the  tests. 

Finally,  It  should  be  noted  that  there  Is  little  evidence  to  support 
the  contention  that  complex  work  sample  tests  are  more  valid  predictors 
than  "single-construct"  measures.  The  validity  data  in  Table  29  show  that 
the  median  validity  of  complex  psychomotor  predictors  across  all  job  types 
and  criteria  was  only  .11  (AM  12  validity  coefficients).  By  comparison, 
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the  median  validity  for  all  psychomotor  abilities  across  all  job  types  and 
criteria  was  .18  (/V- 2,206).  Certainly  the  112  validity  coefficients  for 
complex  psychomotor  predictors  tabulated  in  Table  29  represent  a  less  than 
exhaustive  sample  of  the  validity  evidence  for  work  sample  tests.  Never¬ 
theless,  given  the  high  test  development  costs  for  most  work  sample  tests, 
It  would  seem  that  much  more  evidence  of  the  predictive  validity  of  these 
tests  would  need  to  be  accumulated  before  the  expense  of  developing  these 
measures  can  be  justified. 

In  conclusion,  It  appears  that  there  may  be  nothing  to  gain  from 
Investing  a  substantial  amount  of  effort  in  the  development  of  work  sample 
tests.  Work  sample  tests  tend  to  be  expensive  to  develop  and  there  Is  no 
evidence  that  they  are  more  valid  than  tests  which  are  designed  to  measure 
ability  constructs.  All  this  suggests  that  a  construct-oriented  test 
development  strategy,  based  on  Fleishman's  psychomotor  ability  taxonomy, 
would  be  the  best  strategy  for  this  research  program. 
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SECTION  VI 


SUMMARY  AND  RECOMMENDATIONS 

Summary  of  Ma.ior  Findings 

Several  taxonomies  have  been  suggested  for  describing  the  psychomotor 
ability  domain.  By  far  the  most  popular  taxonomy  is  that  based  on  the 
factor  analytic  research  of  Edwin  Fleishman  and  his  associates  (Fleishman, 
1967,  1972).  This  review  represents  an  effort  to  explicate  and  evaluate 
the  predictive  validity  and  practical  utility  of  nine  of  the  abilities 
Identified  by  Fleishman:  multilimb  coordination,  control  precision,  rate 
control,  finger  dexterity,  manual  dexterity,  wrist-finger  speed,  aiming, 
arm-hand  steadiness,  and  speed  of  arm  movement. 

Fleishman’s  taxonomy  was  selected  for  evaluation  primarily  because  the 
abilities  In  this  taxonomy  seemed  particularly  relevant  to  the  criteria  of 
Interest  In  this  selection  and  classification  research  project.  That  Is, 
the  criteria  In  this  study  represent  rather  complex  behaviors  (e.g.,  Repair 
Mechanical  Systems).  It  did  not  seem  likely  that  extremely  basic 
perceptual -motor  processes  from  an  Information  processing  model  of  motor 
skills  learning  would  be  very  powerful  predictors  of  these  criteria.  On 
the  other  hand,  It  also  seemed  unlikely  that  a  single  broad  psychomotor 
coordination  dimension  could  adequately  capture  and  predict  the  varied 
motor  skills  represented  In  the  different  criteria.  The  ability  taxonomy 
suggested  by  Fleishman  was  therefore  judged  to  be  the  most  appropriate  for 
this  review. 

One  of  the  major  problems  with  Fleishman’s  taxonomy  is  the  lack  of 
construct  explication  for  the  nine  abilities.  Indeed,  this  would  have  been 
a  problem  with  any  of  the  psychomotor  ability  taxonomies  reviewed  for  this 
paper.  Virtually  no  research  has  been  devoted  to  Identifying  and  develop¬ 
ing  a  nomologlcal  net  for  any  psychomotor  construct.  In  one  of  the  few 
scholarly  attempts  to  explicate  the  psychomotor  ability  domain,  Imhoff  and 
Levine  (1981)  suggested  that  the  abilities  which  control  precision  and 
speed  of  arm  movement  are  most  relevant  to  the  types  of  motor  skills  under 
open-loop  control  (Keele,  1968),  while  multilimb  coordination  and  rate 
control  affect  tasks  under  closed-loop  control  (Adams,  1971).  Much  work, 
however,  remains  to  be  done  In  this  area. 

To  evaluate  the  probable  criterion-related  validity  of  these  nine 
abilities,  a  comprehensive  review  of  the  literature  was  conducted.  Over 
2,300  validity  coefficients  were  Identified.  These  coefficients  were  cate¬ 
gorized  by  predictor  construct  (l.e.,  ability),  criterion,  job  type,  and 
research  setting  (l.e.,  military  vs.  nonmilltary).  An  analysis  of  these 
validities  revealed  that  performance  In  several  job  types  was  predictable 
from  psychomotor  abilities.  For  example:  multillmb  coordination  and  con¬ 
trol  precision  demonstrated  validity  as  predictors  for  professional,  tech¬ 
nical,  and  managerial  jobs;  manual  dexterity  and  wrist-finger  speed  demon¬ 
strated  validity  for  clerical  jobs;  multillmb  coordination,  manual  dexter¬ 
ity,  and  wrist-finger  speed  demonstrated  validity  for  protective  service 
jobs;  and  finger  dexterity,  manual  dexterity,  and  wrist-finger  speed  demon¬ 
strated  validity  for  service  jgbs,  mechanical  and  structural  maintenance 
jobs,  and  Industrial  jobs. 
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An  evaluation  of  the  validity  of  psychomotor  tests  used  In  the  mili¬ 
tary  was  not  possible  because  psychomotor  tests  have  not  been  widely  em¬ 
ployed  by  the  Armed  Forces.  Almost  72%  of  the  547  psychomotor  validity 
coefficients  from  military  research  which  were  located  during  the  litera¬ 
ture  review  were  based  on  studies  of  pilots.  No  psychomotor  validity 
coefficients  were  identified  for  many  common  Army  jobs,  including  infantry¬ 
man,  administrative  specialist,  motor  transport  operator,  and  cannon  crew¬ 
man. 


Overall,  however,  the  validity  picture  for  psychomotor  tests  is  quite 
promising.  The  median  of  the  2,373  validity  coefficients  was  .17,  and  the 
median  validity  of  .20  for  job  proficiency  criteria  (AM, 358  validity 
coefficients)  was  actually  greater  than  the  median  validity  of  .15  for 
training  performance  criteria  (A/-603  validity  coefficients). 

Since  there  are  no  data  on  the  standard  deviation  of  job  performance 
in  dollars  for  most  Army  jobs,  and  since  the  incremental  validity  of  psy¬ 
chomotor  tests  (i.e.,  above  and  beyond  the  validity  of  current  Army  selec¬ 
tion  and  classification  procedures)  for  different  Army  jobs  is  unknown,  it 
is  currently  impossible  to  assess  the  utility  of  psychomotor  testing  for 
the  Army.  It  would  appear,  however,  that  psychomotor  tests  would  have 
significant  incremental  validity. 

As  part  of  the  literature  review,  1,066  correlations  between 
cognitive-perceptual  and  psychomotor  tests  were  identified.  These  correla¬ 
tions  were  categorized  by  cognitive-perceptual  ability  and  psychomotor 
ability.  Median  correlations  were  computed  for  each  cognitive-perceptual/ 
psychomotor  ability  combination.  Analysis  of  these  median  correlations 
Indicated  that,  with  the  possible  exception  of  wrist-finger  speed,  there, 
was  only  minimal  overlap  between  the  cognitive-perceptual  and  psychomotor 
abilities.  Interestingly,  two  of  the  cognitive-perceptual  abilities  with 
the  lowest  correlations  with  psychomotor  abilities  were  verbal  comprehen¬ 
sion  and  reasoning.  These  are  also  the  two  cognitive-perceptual  abilities 
which  are  most  highly  correlated  with  general  Intelligence,  g.  Thus,  there 
Is  apparently  very  little  overlap  between  psychomotor  abilities  and  g. 

These  results  suggest  that  any  variance  psychomotor  tests  share  with  cri¬ 
terion  measures  will  be  variance  which  cannot  be  tapped  by  any  cognitive- 
perceptual  ability  test.  Given  the  low  correlations  between  psychomotor 
and  cognitive-perceptual  tests,  any  variance  shared  by  psychomotor  tests 
and  criterion  measures  should  result  In  significant  Increments  In  the 
overall  validity  of  the  test  battery. 

The  stability  of  psychomotor  tests  has  been  a  concern  to  some  selec¬ 
tion  researchers.  For  example,  Navy  researchers  have  found  that  even  the 
most  stable  psychomotor  tests  do  not  attain  differential  stability  until 
subjects  have  had  30  minutes  of  practice  per  day  for  six  days  (Jones  et 
al.,  1981).  It  was  shown,  though,  that  differential  stability  (Bittner, 
1979)  Is  not  the  same  as  test-retest  stability.  When  psychomotor  test 
reliability  coefficients  were  examined,  It  was  found  that  the  median  test- 
retest  reliability  coefficient  for  Intertest  Intervals  ranging  from  Imme¬ 
diate  to  30  days  was  .78.  While  this  may  attenuate  the  validity  coeffi¬ 
cients  for  psychomotor  tests  somewhat,  It  Is  certainly  not  so  serious  a 
problem  as  to  jeopardize  the  use  of  psychomotor  tests  In  selection  and 
classification  research. 
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Other  concerns  with  psychomotor  tests  have  centered  on  the  expense, 
unreliability,  and  transportability  problems  associated  with  psychomotor 
testing.  Several  times  previously,  different  branches  of  the  Armed  Forces 
have  decided  against  using  psychomotor  tests  for  selection  and  classifica¬ 
tion  purposes  because  of  the  unreliability  of  the  apparatus  used  to  ad¬ 
minister  the  tests.  Mechanical  breakdowns  have  occurred  frequently.  In 
addition,  there  have  often  been  considerable  disparities  between  different 
units  of  a  given  apparatus,  resulting  In  significant  differences  In  sub¬ 
jects’  test  scores  as  they  are  tested  on  different  units.  With  the  advent 
of  microcomputer  technology,  most  of  the  problems  associated  with  cost, 
reliability,  and  transportability  have  been  alleviated.  This  removes  a 
major  barrier  to  the  widespread  use  of  psychomotor  tests. 

A  critical  Issue  In  all  selection  testing  today  Is  how  members  of 
different  subgroups  fare  on  the  test.  There  have  been  very  few  studies  of 
the  Impact  of  group  differences  on  psychomotor  test  scores.  Studies  with 
the  GATB  revealed  significant  sex  differences  on  two  of  the  three  psycho¬ 
motor  scales  (U.S.  Department  of  Labor,  1970).  GATB  research  has  also 
revealed  significant  differences  between  mean  psychomotor  test  scores  for 
different  ethnic  groups.  Hlspanicsand  Orientals  tend  to  score  highest  on 
all  three  GATB  scales,  followed  by  American  Indians  and  whites,  with  blacks 
scoring  lowest  (U.S.  Department  of  Labor,  1970).  The  range  between  the 
mean  scores  of  the  highest  and  lowest  scoring  groups  has  averaged  approxi¬ 
mately  one  standard  deviation.  Results  from  AAF  research  conducted  during 
World  War  II  revealed  only  very  small  differences  between  mean  apparatus 
test  scores  for  blacks  and  whites  (Melton,  1947).  Tests  for  differential 
validity  were  not  possible  with  the  GATB  because  single-scale  validity 
coefficients  were  not  reported  for  different  subgroups.  In  the  AAF,  there 
were  no  differences  In  test  validities  for  men  and  women.  In  general, 
however,  the  psychomotor  apparatus  tests  used  In  the  ACB  had  virtually  no 
validity  for  blacks  and  moderate  to  high  validities  for  whites.  This  case 
represents  one  of  the  few  examples  of  differential  validity  reported  in  the 
testing  literature.  It  Is  based  on  only  one  sample  of  blacks,  however,  and 
the  results  are  more  than  40  years  old.  Thus,  Its  Implications  for  psycho¬ 
motor  test  research  today  are  uncertain.  The  results  do  suggest  that 
statistical  tests  for  differential  validity  should  be  conducted  for  any  new 
psychomotor  measures. 

A  Suggested  Priority  for  Psychomotor  Test  Development  Efforts 

As  noted  In  the  Introduction  to  this  report,  the  goal  throughout  this 
literature  review  has  been  to  Identify  a  set  of  psychomotor  abilities  that 
are  likely  to  be  related  to  training  performance  and  job  proficiency  cri¬ 
teria  In  the  Army.  Based  In  part  on  the  recommendations  resulting  from 
this  process,  new  psychomotor  tests  will  be  developed  and  Incorporated  Into 
a  new  experimental  Army  selection  and  classification  battery.  The  utility 
and  validity  of  these  new  tests  will  then  be  evaluated  over  the  next 
several  years. 

Several  criteria  were  considered  In  making  these  recommendations. 
First,  the  validity  of  tests  of  each  ability  in  previous  studies  was  exa¬ 
mined.  Abilities  which  demonstrated  moderate  or  high  validities  across 
several  Army  job  types  were  given  preference.  This  was  largely  a  practical 
consideration.  Because  total  testing  time  will  be  limited,  and  because  all 
new  recruits  will  receive  the  same  test  battery  regardless  of  their  stated 
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occupational  preference,  abilities  were  given  priority  If  they  were  likely 
to  be  useful  In  predicting  success  In  several  Army  job  types. 

A  second  criterion  was  to  ensure  that,  Insofar  as  possible,  valid 
psychomotor  tests  were  available  for  each  job  type.  When  only  one  or  two 
abilities  were  valid  predictors  of  performance  for  certain  job  types,  those 
abilities  were  given  extra  consideration  when  making  final  recommendations. 
Third,  the  difficulty  of  developing  and  administering  tests  was  evaluated 
for  each  ability.  There  was  a  strong  desire  to  avoid  any  tests  requiring 
Individual  administration  and  scoring.  Therefore,  preference  was  given  to 
abilities  which  could  be  assessed  easily  via  paper-and-pencll  or  compu¬ 
terized  measures.  Finally,  the  processes  underlying  each  ability  were 
considered.  For  example,  an  effort  was  made  to  ensure  coverage  of  both 
open-loop  control  and  closed-loop  control  phenomena  In  the  experimental 
test  battery.  Other  motor  or  perceptual  requirements  Inherent  In  measures 
of  each  ability  were  also  considered. 

Based  on  these  criteria.  It  Is  recommended  that  tests  be  developed  ac¬ 
cording  to  the  following  prioritization: 

First  Priority 

1.  Multilimb  coordination 


Second  Priority 

2.  Control  precision 

3.  Manual  dexterity 

4.  Rate  control 

5.  Speed  of  arm  movement 

Third  Priority 

6.  Wrist-finger  speed 

7.  Finger  dexterity 

Efltirtll  Priority 

8.  Arm-hand  steadiness 


Fifth  Priority 

9.  Aiming 

Multi  limb  Coordination.  Multilimb  coordination  was  given  the  highest 
priority  for  several  reasons.  First,  tests  of  multilimb  coo.  Mnation’have 
demonstrated  high  validity  as  predictors  of  training  performance  for  both 
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protective  service  jobs  and  professional,  technical,  and  managerial  jobs. 
Second,  multlllmb  coordination  represents  one  of  the  psychomotor  abilities 
related  to  performance  of  tasks  under  closed-loop  control  (Imhoff  &  Levine, 
1981).  Tests  of  multlllmb  coordination  typically  require  subjects  to 
monitor  their  performance  closely  and  make  continuous  motor  adjustments. 
Moreover,  because  tests  of  multlllmb  coordination  require  careful  coordina¬ 
tion  of  two  limbs,  multlllmb  coordination  closely  approximates  what  many 
people  have  in  mind  when  they  speak  of  a  general  psychomotor  "klutziness" 
dimension.  Thus,  this  ability  Is  of  very  great  theoretical  interest. 
Finally,  multilimb  coordination  lends  itself  readily  to  computerized  as¬ 
sessment. 

By  definition,  tests  of  multlllmb  coordination  require  the  subject  to 
use  two  or  more  limbs  simultaneously.  Not  all  tests  which  require  the  use 
of  two  limbs  are  measures  of  multlllmb  coordination,  however.  For  example, 
the  Complex  Coordination  Tests  developed  by  Sanders  et  al.  (1971)  and  Myers 
et  al.  (1982)  require  the  subject  to  perform  separate  tracking  tasks  with 
the  right  and  left  hands.  By  contrast,  many  of  the  apparatus  tests  used 
during  World  War  II  (e.g.,  Two-Hand  Coordination  and  Rudder  Control)  re¬ 
quire  subjects  to  use  their  hands  and/or  their  hands  and  feet  to  accomplish 
but  one  task.  The  emphasis  in  the  latter  is  on  the  careful  coordination  of 
two  or  more  limbs,  while  the  former  emphasizes  the  performance  of  two  tasks 
simultaneously.  The  former  actually  assesses  time-sharing  ability  as  much 
as  multlllmb  coordination.  While  both  types  of  tests  are  Interesting,  it 
Is  recommended  that  at  least  one  "pure"  measure  of  multlllmb  coordination 
be  included  in  the  experimental  test  battery. 

In  his  factor  analytic  studies,  Fleishman  found  that  tests  requiring 
the  use  of  two  hands  and  tests  requiring  the  use  of  one  hand  and  one  foot 
both  loaded  on  the  same  multlllmb  coordination  factor.  Thus,  If  only  one 
multlllmb  coordination  test  Is  Included  In  the  experimental  test  battery, 

It  should  not  matter  which  two  limbs  the  subject  Is  asked  to  use.  Never¬ 
theless,  If  It  Is  possible  to  Include  two  multlllmb  coordination  tests,  it 
would  be  best  if  one  required  the  use  of  two  hands  and  the  other  required 
the  use  of  one  hand  and  one  foot. 

Control  Precision.  Tests  of  control  precision  have  demonstrated  val¬ 
idity  for  predicting  performance  in  protective  service  jobs  and  profes¬ 
sional,  technical,  and  managerial  jobs,  though  the  median  validity  for 
protective  service  jobs  has  only  been  moderate  (median  r*.14,  /V« 7  validity 
coefficients).  Tests  of  multlllmb  coordination,  however,  have  demonstrated 
even  higher  validity  for  predicting  performance  in  these  two  job  types. 
Therefore,  the  validity  of  control  precision  tests  was  not  the  chief  reason 
for  recommending  that  these  measures  be  included  in  the  experimental  test 
battery.  Instead,  control  precision  is  interesting  primarily  because  it 
appears  to  be  related  to  performance  of  tasks  controlled  by  motor  programs 
(Imhoff  &  Levine,  1981).  Thus,  including  control  precision  tests  in  the 
experimental  test  battery  will  ensure  coverage  of  both  open-loop  control 
and  closed-loop  control  processes.  Like  multlllmb  coordination,  control 
precision  lends  itself  readily  to  computerized  testing. 

The  most  popular  test  of  control  precision  has  been  the  Rotary  Pur¬ 
suit.  For  the  Rotary  Pursuit,  the  subject  must  track  an  object  moving  in  a 
known  direction  at  a  constant  rate  of  speed.  Thus,  one  type  of  compu¬ 
terized  control  precision  measure  could  involve  a  simple  pursuit  tracking 
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task  in  which  the  target  moves  in  a  known  direction  at  a  constant  rate  of 
speed.  Another  control  precision  test  which  has  received  some  attention  is 
Controls  Adjustment.  This  test  involves  another  type  of  simple  "tracking" 
task.  On  each  test  trial,  a  stationary  target  stimulus  is  presented  to  the 
subject,  who  must  manipulate  a  control  stick  as  quickly  as  possible  co  move 
a  pursuit  stimulus  so  that  its  position  matches  that  of  the  target  stim¬ 
ulus.  A  computerized  version  of  this  test  would  serve  as  an  excellent 
supplement  to  the  more  traditional  control  precision  pursuit  tracking  task. 

Manual  Dexterity.  Tests  of  manual  dexterity  have  proven  valid  as  pre¬ 
dictors  of  performance  for  several  job  types,  including  clerical  (median 
r«.15,  N* 49  validity  coefficients),  protective  service  (median  r«.15,  N* 5 
validity  coefficients),  service  (median  r-.20,  N* 50  validity  coefficients), 
and  mechanical  and  structural  maintenance  (median  r-.17,  N* 63  validity 
coefficients).  Thus,  manual  dexterity  meets  the  most  Important  criterion 
for  Inclusion  In  the  experimental  battery  in  that  it  has  demonstrated 
validity  for  a  number  of  different  job  types.  Since  little  or  no  validity 
evidence  was  available  for  multilimb  coordination  and  control  precision  for 
three  of  these  job  types,  tests  of  manual  dexterity  would  help  ensure  that 
valid  psychomotor  predictors  were  available  for  each  Army  job  type.  Manual 
dexterity  also  holds  some  theoretical  Interest  because  It  probably  ranks 
near  multilimb  coordination  as  an  Index  of  general  "klutziness." 

The  major  problem  with  existing  manual  dexterity  tests  is  that  they 
all  require  individual  administration;  no  one  has  yet  developed  a  manual 
dexterity  test  which  can  be  administered  in  a  group  setting  or  via  com¬ 
puter.  The  ever  Increasing  number  of  peripheral  devices  available  for 
microcomputers,  Including  light  pens,  bit  packs,  and  graphics  tablets,  may 
facilitate  a  creative  solution  to  this  problem. 

Failing  that,  the  best  alternative  would  be  to  develop  a  test  that 
combines  elements  of  common  finger  and  manual  dexterity  tests.  Such  a  test 
might  require  subjects  to  perform  some  type  of  assembly  and/or  disassembly 
operation  with  blocks,  similar  to  the  assembly  and  disassembly  tasks  com¬ 
prising  the  GATB  Finger  Dexterity  Test.  The  only  difference  between  this 
new  test  and  the  GATB  Finger  Dexterity  Test  would  be  that  the  blocks 
Involved  in  the  new  test  would  be  larger  than  the  pegs  involved  in  the  GATB 
test.  Thus,  the  subject  would  be  required  to  make  both  fine,  highly  con¬ 
trolled  manipulations  and  movements  with  the  fingers  (l.e.,  during  assembly 
and  disassembly)  and  more  gross  manipulations  and  movements  with  the  hand 
(l.e.,  during  lifting,  turning,  and  placing).  Since  such  a  test  would 
combine  elements  of  manual  and  finger  dexterity,  it  would  alleviate  the 
need  to  assess  finger  dexterity  separately. 

Rate  Control .  Rate  control,  like  multilimb  coordination.  Is  involved 
in  performance  of  tasks  under  closed-loop  control.  Rate  control  is  prob¬ 
ably  a  purer--or  at  least  somewhat  different--measure  of  closed-loop  con¬ 
trol  processes  than  multilimb  coordination  Is,  since  the  motor  response  re¬ 
quired  In  rate  control  tests  Is  somewhat  simpler  than  that  required  in 
multilimb  coordination  tests.  This  is  because  rate  control  tests  do  not 
require  the  subject  to  carefully  coordinate  the  movements  of  two  or  more 
limbs.  This  difference  between  multilimb  coordination  and  rate  control 
makes  rate  control  a  theoretically  Interesting  construct. 

Unfortunately,  there  have  been  few  investigations  of  the  validity  of 
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rate  control  tests.  Moreover,  the  few  validities  which  have  been  obtained 
have  not  been  outstanding  (median  r».06  across  all  job  types  and  criteria, 
16  validity  coefficients).  Thus,  It  Is  not  readily  apparent  whether  rate 
control  would  contribute  to  the  predictive  validity  of  the  experimental 
test  battery. 

Nevertheless,  a  lack  of  validity  research  on  rate  control  measures  and 
a  theoretical  Interest  In  this  construct  due  to  its  association  with 
closed-loop  control  processes  suggest  that  rate  control  deserves  additional 
attention  and  research.  Therefore,  It  Is  recommended  that  rate  control 
measures  be  Included  In  the  experimental  test  battery. 

Rate  control  Is  usually  assessed  via  tracking  tests.  The  distinguish¬ 
ing  feature  of  these  tests  Is  that  the  target  stimulus  is  making  continuous 
and  unpredictable  changes  In  speed  and  direction. 

Two  different  types  of  tracking  tests  are  commonly  used.  In  a  compen¬ 
satory  tracking  test,  subjects  must  keep  a  moving  stimulus  as  close  as 
possible  to  a  fixed  null  point.  In  a  pursuit  tracking  test,  subjects 
control  a  pursuit  stimulus  which  they  attempt  to  keep  centered  about  a 
target  stimulus.  It  would  be  Interesting  to  examine  the  validity  of  rate 
control  tests  Involving  both  types  of  tracking  tasks.  If  there  Is  or.ly 
time  to  administer  one  rate  control  test,  however,  the  best  procedure  would 
be  to  ensure  that  both  pursuit  and  compensatory  tracking  tests  are  Included 
In  the  experimental  test  battery.  Thus,  the  choice  of  a  rate  control  test 
might  depend  on  the  types  of  tracking  tests  used  In  the  measures  of  multi  - 
limb  coordination  and  control  precision. 

If  time  permits,  a  novel  type  of  compensatory  tracking  test  might  also 
be  Included  In  the  experimental  test  battery.  In  this  test,  a  digital 
display  Instead  of  a  moving  target  would  be  used  as  a  stimulus.  The 
subject's  task  would  be  to  use  a  dial  or  sliding  resistor  to  keep  the 
display  number  as  close  to  zero  as  possible.  Such  a  procedure  would  appear 
to  reduce  the  spatial  requirements  of  the  test.  It  would  therefore  be 
Interesting  to  compare  the  validity  and  cognitive-perceptual  correlates  of 
such  a  test  to  the  validity  and  cognitive-perceptual  correlates  of  more 
traditional  compensatory  tracking  tests. 

Speed  of  Arm  Movement.  Two  of  the  nine  psychomotor  abilities  In  our 
taxonomy  were  Included  by  Imhoff  and  Levine  (1981)  In  their  basic  movement 
speed  and  accuracy  dimension.  These  abilities  were  control  precision  and 
speed  of  arm  movement.  According  to  Imhoff  and  Levine,  both  of  these 
abilities  are  related  to  motor  behavior  controlled  by  open-loop  processes. 
Yet,  there  Is  an  Important  difference  between  control  precision  and  speed 
of  arm  movement.  In  tests  of  control  precision  (e.g.,  Rotary  Pursuit),  the 
subject  may  be  required  to  make  very  accurate  movements.  Scores  on  control 
precision  tests  are  often  error  or  accuracy  measures  (e.g.,  root-mean- 
square  error,  time  on  target).  On  the  other  hand,  in  tests  of  speed  of  arm 
movement  (e.g.,  Two-Plate  Tapping),  the  need  for  accurate  movement  is 
minimized.  Targets  In  speed  of  arm  movement  tests  are  typically  very 
large,  so  that  virtually  any  movement  In  the  direction  of  the  target  is 
likely  to  make  contact  with  the  target.  The  total  emphasis  on  speed  rather 
than  accuracy  makes  speed  of  arm  movement  unique  In  the  psychomotor  ability 
domain.  That  is  the  chief  reason  for  recommending  that  speed  of  arm  move¬ 
ment  be  Included  In  the  experimental  test  battery. 
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Unfortunately,  as  Table  29  shows,  there  are  almost  no  validity  data 
for  speed  of  arm  movement.  This  makes  It  difficult  to  argue  that  a  speed 
of  arm  movement  test  would  contribute  significantly  to  the  prediction  of 
performance  In  cne  or  more  job  types.  It  also  raises  questions  about  the 
utility  of  developing  a  test  specifically  to  assess  this  ability. 

There  Is  a  way  to  keep  any  test  development  costs  minimal,  however. 

One  measure  considered  for  Inclusion  In  the  experimental  computer  battery 
Is  reaction  time.  In  his  recent  research,  Jensen  has  used  a  test  apparatus 
which  permits  the  assessment  of  both  reaction  time  and  movement  time  on 
each  trial  (e.g.,  Jensen,  1982).  Subjects  begin  a  trial  with  their  Index 
finger  on  a  home  button.  When  the  stimulus  Is  presented,  subjects  move 
their  finger  from  the  home  button  to  a  large  target  button.  The  time  from 
the  presentation  of  the  stimulus  until  the  release  of  the  home  button  Is 
the  subject's  reaction  time.  The  time  from  the  release  of  the  home  button 
until  the  target  button  Is  pressed  Is  the  movement  time.  Movement  time  Is, 
of  course,  a  measure  of  speed  of  arm  movement. 

In  sum,  even  though  there  Is  no  validity  evidence  for  speed  of  arm 
movement,  It  Is  recommended  that  a  measure  of  this  ability  be  Included  in 
the  experimental  test  battery.  This  Is  because  speed  of  arm  movement 
represents  perhaps  the  only  psychomotor  ability  where  movement  accuracy  is 
not  a  factor.  Moreover,  speed  of  arm  movement  can  be  measured  at  the  same 
time  as  reaction  time.  Thus,  Including  a  speed  of  arm  movement  measure  in 
the  experimental  test  battery  will  not  Increase  total  testing  time  for 
subjects  and  will  result  in  only  very  minimal  Increases  In  test  development 
costs. 

Wrist-Finger  Speed.  Table  29  shows  that  wrist-finger  speed  has  been  a 
valid  predictor  of  training  performance  and  job  proficiency  for  several  job 
types,  Including  clerical,  protective  service,  and  Industrial.  Neverthe¬ 
less,  there  are  at  least  three  reasons  for  omitting  tests  of  wrist-finger 
speed  from  the  experimental  test  battery. 

First,  manual  dexterity  Is  also  a  valid  predictor  of  performance  for 
clerical,  protective  service,  and  Industrial  jobs.  Thus,  wrist-finger 
speed  would  not  add  to  the  number  of  job  types  for  which  valid  psychomotor 
predictors  are  available  In  the  experimental  test  battery. 

Second,  wrist-finger  speed  is  not  of  compelling  theoretical  Interest. 
Like  control  precision  and  speed  of  arm  movement,  wrist-finger  speed  is 
most  closely  related  to  movement  controlled  by  open-loop  processes.  Thus, 
wrist-finger  speed  falls  within  Imhoff  and  Levine’s  (1981)  basic  movement 
speed  and  accuracy  dimension.  Within  this  dimension,  speed  of  arm  movement 
requires  great  speed  and  little  accuracy  and  control  precision  requires 
approximately  equal  amounts  of  both  speed  and  accuracy.  Wrist-finger  speed 
falls  somewhere  between  the  two,  requiring  somewhat  more  speed  than  ac¬ 
curacy.  Wrist-finger  speed  would  therefore  probably  not  add  greatly  to  our 
understanding  of  this  dimension.  Moreover,  the  movements  involved  in 
wrist-finger  speed  are  quite  simple.  The  movements  do  not  require  a  great 
deal  of  precision,  coordination,  or  timing. 

Third,  most  tests  of  wrist-finger  speed  in  current  use  must  be  scored 
by  hand.  While  It  might  be  possible  to  adapt  wrist-finger  speed  tests  to 


168 


the  computer  using  a  light  pen,  the  technology  Is  quite  new  and  may  not  be 
sufficiently  reliable  to  permit  accurate  assessment. 

Because  of  these  three  problems  and  because  total  testing  time  is 
limited,  it  Is  recommended  that  no  tests  of  wrist-finger  speed  be  included 
in  the  experimental  test  battery. 

Finger  Dexterity.  Table  29  shows  that  finger  dexterity  has  also  been 
a  valid  predictor  of  performance  for  several  job  types.  Nevertheless, 
finger  dexterity  tests  would  contribute  little  to  the  experimental  test 
battery.  For  example,  finger  dexterity  tests  would  not  add  to  the  number 
of  job  types  for  which  valid  psychomotor  predictors  were  available.  In 
addition,  finger  dexterity  tests  would  create  even  more  serious  test  admin¬ 
istration  problems  than  wrist-finger  speed  tests  because  finger  dexterity 
tests  would  have  to  be  Individually  administered  as  well  as  hand  scored. 
Thus,  It  Is  recommended  that  no  finger  dexterity  tests  be  Included  In  the 
experimental  test  battery. 

Previously,  It  was  noted  that  many  finger  dexterity  tests  require 
subjects  to  assemble  or  disassemble  objects.  It  was  suggested  that  as¬ 
sembly  and  disassembly  tests  could  be  incorporated  into  a  manual  dexterity 
test.  In  this  manner,  some  of  the  Individual  differences  associated  with 
finger  dexterity  might  be  captured  In  the  manual  dexterity  test.  This 
would  alleviate  the  need  to  administer  separate  tests  of  these  two  abil¬ 
ities.  Given  the  expenses  associated  with  the  administration  of  dexterity 
tests,  this  appears  to  be  the  only  practical  solution  to  the  finger  dex¬ 
terity  testing  problem. 

Arm-Hand  Steadiness.  While  the  focus  In  measures  of  speed  of  arm 
movement  Is  on  movement  speed,  the  focus  In  measures  of  arm-hand  steadiness 
Is  on  movement  accuracy.  In  one  of  the  two  most  common  types  of  arm-hand 
steadiness  tests,  a  subject’s  task  Is  to  trace  through  a  narrow  maze  with  a 
stylus  without  touching  the  sides  of  the  maze.  Arm-hand  steadiness  tests 
are  generally  not  timed.  Subjects  can  attempt  to  minimize  error  scores 
without  worrying  about  time  constraints.  For  this  reason,  arm-hand  steadi¬ 
ness  represents  a  rather  Interesting  ability  construct. 

In  the  past,  the  predictive  validity  of  arm-hand  steadiness  tests  has 
received  only  limited  attention.  The  results  have  generally  been  disap¬ 
pointing.  Table  29  shows  that  validity  data  are  available  for  just  two  job 
types,  and  median  validities,  are  less  than  .10  for  both.  Airplane  pilot  Is 
perhaps  the  only  job  for  which  the  validity  of  arm-hand  steadiness  has  been 
Investigated  extensively.  Since  "smoothness  of  control  movements"  was 
Identified  as  one  of  the  key  reasons  for  success  In  pilot  training  during 
World  War  II  (see  Table  4),  it  would  seem  reasonable  that  arm-hand  steadi¬ 
ness  tests  should  have  at  least  moderate  validity  as  predictors  of  training 
performance  for  pilots.  Table  9  shows  that  this  is  not  the  case;  the 
median  validity  of  arm-hand  steadiness  tests  for  predicting  attrition  from 
pilot  training  was  only  .10  (tf-16  validity  coefficients).  These  results 
suggest  that  arm-hand  steadiness  would  not  contribute  to  the  overall  va¬ 
lidity  of  the  experimental  test  battery. 

Another  problem  with  arm-hand  steadiness  tests  Is  that  they  would  be 
expensive  to  administer.  All  of  the  arm-hand  steadiness  tests  Identified 
In  the  literature  reviewed  for  this  report  were  apparatus  tests  requiring 
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Individual  administration.  Although  It  might  be  possible  to  develop  a  com¬ 
puterized  arm-hand  steadiness  test,  such  a  test  would  probably  require  some 
expensive  hardware  peripherals  (e.g.,  a  graphics  tablet).  The  validity 
evidence  suggests  that  this  would  not  be  a  cost-effective  investment  In 
hardware . 

Primarily  because  past  research  suggests  that  the  validity  of  arm-hand 
steadiness  tests  would  be  negligible,  It  is  recommended  that  no  tests  of 
arm-hand  steadiness  be  Included  In  the  experimental  test  battery. 

Aiming.  Aiming  Is  another  psychomotor  ability  which  has  received  very 
little  attention  in  validity  research.  The  very  name  of  the  ability, 
"aiming,"  is  Intriguing,  though.  The  name  suggests  that  this  ability  might 
be  relevant  to  performance  in  a  wide  variety  of  Army  combat  MOS. 

Closer  Inspection  of  the  tests  used  to  measure  this  ability,  however, 
indicates  that  the  ability  may  be  m1snamed--or,  at  the  very  least,  that  the 
name  might  be  easily  misinterpreted.  A  subject’s  task  in  an  aiming  test  is 
to  place  a  pencil  dot  Inside  a  small  circle.  In  some  tests,  the  circles 
are  numbered  (e.g.,  from  one  to  100),  and  the  subject  must  dot  the  circles 
In  order.  Regardless  of  the  specific  format  and  instructions  of  the  tests, 
the  subject’s  basic  task  does  not  change.  The  subject  must  still  place 
pencil  dots  In  small  circles. 

In  an  attempt  to  describe  the  ability  underlying  performance  on  these 
tests,  Fleishman  settled  upon  the  name  "aiming."  Often  he  expanded  on  this 
name,  referring  to  the  ability  as  "aiming  (eye-hand  coordination)"  In  many 
of  his  factor  analytic  studies.  Yet,  there  Is  no  construct  validity  evi¬ 
dence  to  relate  the  ability  to  performance  on  any  tasks  requiring  either 
aiming  (e.g.,  aiming  a  gun)  or  eye-hand  coordination  (e.g.,  hitting  a 
baseball  or  catching  a  football).  Nor  does  there  seem  to  be  any  reason  to 
expect  that  the  ability  would  generalize  to  any  task  other  than  aiming  a 
pencil  at  a  small  circle.  Indeed,  that  Is  probably  a  more  appropriate  name 
for  this  ab111ty--"aiming  a  pencil  at  a  small  circle." 

Because  of  the  lack  of  either  predictive  or  construct  validity  evi¬ 
dence,  It  Is  recommended  that  no  measure  of  aiming  be  Included  In  the 
experimental  test  battery. 

Job  Content  and  the  Selection  of  Psvchomotor  Tests 

As  noted  previously,  the  recommendations  above  are  based  primarily  on 
the  validity  evidence  presented  In  Tables  8-29.  The  ease  and  expense  of 
administering  and  scoring  measures  of  each  ability  were  also  considered, 
though  these  considerations  were  accorded  somewhat  less  weight  In  deriving 
recommendations.  The  final  choice  of  psychomotor  abilities  for  the  experi¬ 
mental  test  battery  will  require  consideration  of  one  additional  source  of 
information:  the  content  of  Army  jobs. 

There  are  several  potential  sources  for  this  job  content  information. 
For  example,  as  part  of  this  selection  and  classification  research  project, 
training  and  job  performance  dimensions  are  being  identified  for  a  number 
of  key  Army  MOS.  Preliminary  descriptions  of  many  of  these  dimensions  are 
currently  available.  In  addition,  there  have  been  several  opportunities 
for  members  of  the  test  development  research  team  to  conduct  job  observa- 
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tlons  and  to  examine  training  simulators.  Since  these  job  observations 
have  focused  on  MOS  which  require  subjects  to  perform  many  skilled  percep¬ 
tual  and  motor  tasks,  this  job  content  Information  should  be  especially 
relevant  to  our  efforts  to  Identify  a  final  set  of  psychomotor  abilities 
for  the  experimental  test  battery. 

Job  content  Information  gathered  from  these  sources  might  suggest  that 
wrist-finger  speed,  finger  dexterity,  arm-hand  steadiness,  and/or  aiming 
are  vital  to  the  performance  of  many  Army  jobs  and  should  be  measured  in 
the  experimental  test  battery.  Conversely,  the  job  content  Information 
might  reveal  that  multilimb  coordination,  control  precision,  manual  dex¬ 
terity,  rate  control,  and/or  speed  of  arm  movement  are  totally  Irrelevant 
to  performance  In  the  Army  and  need  not  be  Included  In  the  experimental 
test  battery.  These  Issues,  although  beyond  the  scope  of  this  report,  must 
be  considered  to  ensure  that  the  appropriate  psychomotor  abilities  are 
represented  In  the  experimental  test  battery. 
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APPENDIX  A 

Descriptions  of  Selected  Psychomotor  Tests 


A-0 


Table  of  Contents 


Psvchomotor  Test _ Eage 

Arm-Hand  Steadiness  Test  A-2 

Complex  Coordination  Test  (Melton,  1947}  A-4 

Complex  Coordination  Test  (Sanders  et  al.,  1971)  A-8 

GATB  Finger  Dexterity  Scale  A- 10 

GATB  Manual  Dexterity  Scale  A- 13 

GATB  Motor  Coordination  Scale  A- 16 

Large  Tapping  Test  A- 18 

Minnesota  Rate  of  Manipulation  Test  A- 19 

Motor  Judgment  Test  A-21 

O’Connor  Tweezer  and  Finger  Dexterity  Test  A-23 

Purdue  Pegboard  A-25 

Rate  Control  A- 27 

Rotary  Pursuit  Test  A-29 

Rudder  Control  Test  A-32 

Santa  Ana  Finger  Dexterity  Test  A-35 

Single-Dimension  Pursuitmeter  A-38 

Small  Tapping  Test  A-40 

Steadiness  Aiming  Test  A-41 

Trace  Tapping  II  A-43 

Two-Hand  Coordination  Test  (Melton,  1947)  A-44 

Two-Hand  Coordination  Test  (Sanders  et  al.,  1971)  A-48 

Two-Hand  Pursuit  Test  A-50 

Two-Plate  Tapping  Test  A-53 

A-l 


TEST  TITLE: 


Arm-Hand  Steadiness  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


The  steadiness  of  the  arm  and  hand  when  the  arm  is 
extended  at  full  length  (Arm-Hand  Steadiness) 


The  test  apparatus  consists  of  a  metal  rod  or  stylus 
which  extends  through  a  hole  or  aperture  in  a  metal 
plate  (see  the  figure  below).  .  The  subject’s  task  is  to 
hold  the  stylus  within  the  aperture  without  allowing  the 
stylus  to  touch  the  sides  or  edge  of  the  aperture. 


The  test  consists  of  eight  30-second  trials  separated  by 
15-second  rest  periods.  For  the  first  trial,  the 
subject  uses  his  preferred  hand.  For  the  next  six 
trials,  the  subject  completes  two  trials  with  the 
nonpreferred  hand  followed  by  two  trials  with  the 
preferred  hand  followed  by  two  more  trials  with  the 
nonpreferred  hand.  Finally,  for  the  eighth  trial,  the 
subject  again  uses  his  preferred  hand.  The  subject’s 
score  is  either  the  number  of  contacts  made  with  the 
sides  or  edge  of  the  aperture  or  the  total  time  in 
contact  with  the  sides  or  edge  of  the  aperture. 


PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx-.76  for  time  in  contact  score  (Melton,  1947) 

Mean  correlation  between  three  trials  with  the  preferred 
hand,  corrected  for  the  full  number  of  preferred  hand 
trials  (i.e.,  3  trials).  £1-310  unclassified 
candidates  for  elementary  pilot  training. 


Ixx».75  for  number  of  contacts  score  (Melton,  1947) 

One-week  test-retest  reliability.  N-328  unclassified 
candidates  for  elementary  pilot  training. 


Median  rxy«.06  for  total  number  of  contacts  score  with 
graduation-elimination  from  pilot  training  (Melton, 
1947) 

H-6,042  Army  Air  Forces  elementary  pilot  trainees  in 
seven  training  classes. 


Median  rxy-.06  for  total  number  of  contacts  score  with 
Instructor  ratings  on  six  pilot  training  performance 
criteria  (coordination,  appropriate  controls,  feel  of 
controls,  smoothness  of  movement,  progress  in  technique, 
probability  of  success  in  pilot  training)  (Melton,  1947) 

U-1,000  Army  Air  Forces  elementary  pilot  trainees. 


Median  rxy--.03  for  total  number  of  contacts  score  with 
graduation-elimination  from  navigator  training  (Melton, 
1947) 

N-448  Army  Air  Forces  navigator  trainees  In  three 
training  classes. 


Median  rxy-.06  for  total  number  of  contacts  score  with 
with  bombing  accuracy  score  during  bombardier  training 
(Melton,  1947) 

N-331  Army  Air  Forces  bombardier  trainees  in  three 
training  classes. 


A-4 


TEST  TITLE: 


Complex  Coordination  Test  (Melton,  1947) 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Ability  to  simultaneously  coordinate  the  movements  of 
the  hands  and  the  feet  (Multilimb  Coordination) 


Patterns  of  lights  are  presented  whose  positions  are  to 
be  matched  by  appropriate  adjustment  of  stick  and  rudder 
controls  (see  the  figure  below). 


A  correct  response  is  accomplished  only  when  both  the 
hands  and  the  feet  have  completed  and  maintained  the 
appropriate  adjustments,  at  which  point  a  new  pattern  of 
lights  to  be  matched  Is  presented.  Score  is  the  number 
of  completed  matchings  during  two  2-minute  test  periods 
separated  by  a  30-second  rest. 


A-5 


PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx-.89  (Melton,  1947) 

Odd-even  split  half  reliability,  corrected  for  the  full 
length  of  the  test  (4  trials),  {i-7,627  unclassified 
candidates  for  pilot  training. 


Ixx-.82  (Fleishman,  1958) 

Odd-even  split  half  reliability,  corrected  for  the  full 
length  of  the  test  (2  trials).  8-204  basic  trainee 
airmen. 


rxx-.87,  .59,  and  .83  (Melton,  1947) 

Immediate,  one-week,  and  28-day  test-retest 
reliabilities.  8-415,  313,  and  692  unclassified 
candidates  for  pilot  training,  respectively. 


Median  £xy-.29  with  graduation-elimination  from 
elementary  pilot  training  (Melton,  1947) 

8-44,618  Army  Air  Forces  elementary  pilot  trainees  in  24 
training  classes. 


35  with  graduation-elimination  from  elementary 
training  (Craeger,  1957) 


8-2,010  Air  Force  elementary  pilot  trainees. 


Median  £xy-.23  with  graduation-elimination  from  single 
engine  jet  flying  training  (Leiman  and  Friedman,  1952) 

8-3,538  Air  Force  advanced  pilot  trainees  in  two 
training  classes. 


Median  £xy-.38  with  graduation-elimination  from  pilot 
training  t Payne  et  al.,  1952) 

8-1,345  Navy  pilot  trainees  in  two  training  classes. 


Ixy».31  and  .36  with  graduation-elimination  from  primary 
pilot  training  (Zaccaria  and  Cox,  1952) 

8-1,016  aviation  cadets  and  547  student  officers, 
respectively,  in  primary  pilot  training. 


A- 6 


VALIDITY 

(cont’d)  Median  rxv“*43  with  graduation-elimination  from  primary 
pilot  training  (Fleishman,  1954b) 

U-4,311  Air  For  a  pilot  trainees  in  two  training 
classes. 


EvV-.23  with  graduation-elimination  from  pilot  training 
(Mil ins  et  al.,  1968) 

H-120  foreign  military  pilot  trainees  receiving  training 
from  the  U.  S.  Air  Force. 


rxy-.13  with  average  check  ride  grade  (Melton,  1947) 
H-311  Army  Air  Forces  pilot  trainees. 


Median  rxy-.21  with  Instructor  ratings  on  six  pilot 
training  performance  criteria  (coordination,  appropriate 
controls,  feel  of  controls,  smoothness  of  movement, 
progress  In  technique,  probability  of  success  In  pilot 
training)  (Melton,  1947) 

H-1,000  Army  Air  Forces  elementary  pilot  trainees. 


.13  with  Instructor  ratings  of  flying  proficiency 
ton,  1947) 

t{«562  Army  Air  Forces  advanced  pilot  trainees  training 
on  single-engine  planes. 


rxy-.13  and  .07  with  instructor  ratings  of  flying 
proficiency  during  the  advanced  and  the  transition  stage 
of  flying  training,  respectively 

£1-685  Army  Air  Forces  advanced  pilot  trainees  training 
on  four-engine  planes. 


Median  rxy-.10  with  gunnery  proficiency  scores  obtained 
during  training  (Melton,  1947) 

£1-1 ,716  Army  Air  Forces  advanced  pilot  trainees  in  four 
training  classes. 


A-7 


VALIDITY 

(cont’d)  Median  rxv-.25  with  graduation-elimination  from 
helicopter  training  (Zeidner  et  al.,  1958) 

H-249  Army  helicopter  pilot  trainees  in  two  training 
classes. 


Median  exv».20  with  graduation-elimination  from 
navigator  training  (Melton,  1947) 

ti-1,752  Army  Air  Forces  navigator  trainees  in  two 
training  classes. 


Median  EvV"*26  with  graduation-elimination  from 
bombardier  training  (Melton,  1947) 

Ii-4,333  Army  Air  Forces  bombardier  trainees  In  nine 
training  classes. 


EvV-.24  with  percent  of  "on  target"  bombs  In  combat 
(Ail  ton,  1947) 

d-32  Army  Air  Forces  lead  bombardiers. 


rxy-.16  with  gunnery  accuracy  (Melton,  1947) 

H-164  Army  Air  Forces  trained  B-29  remote-control  turret 
gunners. 


Median  rxv«.18  with  composite  flight  and  ground  trainer 
grades  (Melton,  1947) 

tM50  Army  Air  Forces  radar  operator  trainees  in  two 
training  classes. 


Median  rxy».06  with  several  archival  measures  of  driving 
performance  (e.g.,  number  of  accidents,  number  of  moving 
violations)  (Farr  et  al.,  1971) 

ft-299  taxi  cab  drivers. 


A-8 


TEST  TITLE: 


Complex  Coordination  Test  (Sanders  et  al.,  1971) 


DESIGNED  TO 
(CONSTRUCT) 


DESCRIPTION 
OF  TASK: 


MEASURE 

The  speed  at  which  a  subject  makes  complex  hand  and  foot 
motor  adjustments  to  a  series  of  signal  patterns 
(Multlllmb  Coordination) 


This  Is  a  computer-administered  test.  The  subject's 
task  Is  to  use  joysticks  and  a  foot-controlled  rudder  to 
maintain  two  stimuli  In  a  null  position  (see  the  figure 
below).  The  movement  of  both  stimuli  Is  partially  under 
the  control  of  a  computer  program.  This  program  causes 
the  stimuli  to  make  frequent,  unpredictable  changes  In 
speed  and  direction.  The  first  stimulus,  which  Is  X- 
shaped,  can  also  be  controlled  via  a  joystick.  The 
subject  must  attempt  to  keep  this  stimulus  centered  at 
the  intersection  of  the  row  and  column  of  dots.  The 
direction  and  displacement  of  the  joystick  control  the 
direction  of  movement  and  velocity,  respectively,  of  the 
first  stimulus.  The  second  stimulus  is  a  short  vertical 
line  which  appears  near  the  bottom  of  the  display. 
Movement  of  this  stimulus  can  be  controlled  by  a  rudder 
bar.  The  subject  must  use  both  feet  to  manipulate  this 
rudder  bar.  The  direction  and  displacement  of  the 
rudder  bar  control  the  direction  and  velocity, 
respectively,  of  the  second  stimulus.  The  second 
stimulus  can  only  move  along  the  horizontal  dimension. 
The  subject’s  task  is  to  keep  the  second  stimulus 
aligned  along  the  vertical  row  of  dots. 


ADMINISTRATION 
AND  SCORING: 


PSYCHOMETRIC  DATA: 

RELIABILITY 

VALIDITY 


The  test  consists  of  five  1-mlnute  trials.  There  Is  no 
rest  period  between  trials;  the  test  Is  continuous.  The 
subject  receives  five  error  scores:  (1)  horizontal 
deviation  of  the  first  stimulus  from  the  target  point, 
called  the  X  Axis  score;  (2)  vertical  deviation  of  the 
first  stimulus  from  the  target  point,  called  the  Y  Axis 
score;  (3)  the  square  root  of  Ihe  sum  of  squares  of  the 
X  Axis  and  Y  Axis  error  scores  (i.e.,  the  Euclidean 
distance),  called  the  Generated  score;  (4)  the 
horizontal  deviation  of  the  second  stimulus  from  the 
target  point,  called  the  Z  Axis  score;  and  (5)  the 
number  of  times  the  subject  allows  the  second  stimulus 
to  move  off  the  screen,  called  the  Reset  score. 


rxx-.92  for  the  Generated  score  (Sanders  et  al.,  1971) 

Correlation  between  scores  from  minutes  4  and  5  of  the 
test,  corrected  for  length  to  represent  the  reliability 
of  the  sum  of  the  two  scores,  ft-120  Air  Force  officer 
trainees. 


Median  rXy**19,  .24,  .20,  .22,  and  .13  with  graduation- 
elimination  from  pilot  training  for  the  X  Axis  score,  Y 
Axis  score.  Generated  score,  Z  axis  score,  and  Reset 
score,  respectively  (McGrevey  and  Valentine,  1974) 


N-213  Air  Force  officer  undergraduate  pilot  trainee:  in 
two  training  classes. 


-.18  with  graduation-elimination  from  pilot  traini 
ry  and  Goodman,  In  preparation) 


ng 


M-294  Navy  undergraduate  pilot  trainees. 


A-10 


TEST  TITLE: 

DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASKS: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 

RELIABILITY 


VALIDITY 


General  Aptitude  Test  Battery  (GATB)  Finger  Dexterity 
Scale 


Finger  Dexterity 


Assemble  Test.  The  board  consists  of  50  holes  and  two 
trays  containing  rivets  and  washers,  respectively.  The 
subject  must  pick  up  a  rivet  with  the  preferred  hand  and 
a  washer  with  the  non-preferred  hand.  The  subject  must 
then  insert  the  rivet  through  one  of  the  holes  in  the 
board  and  secure  it  with  the  washer. 


Disassemble  Test.  The  bottom  test  board  consists  of  50 
rivets  secured  into  holes  with  washers,  and  the  top  test 
board  consists  of  50  holes.  The  subject  must  remove  the 
washer  from  the  rivet,  place  the  washer  on  a  ring, 
remove  the  rivet  from  the  hole,  and  place  the  rivet  in 
an  empty  hole  in  the  top  test  board. 


The  subject’s  score  Is  the  number  of  parts  assembled  and 
disassembled  In  one  and  one-half  minutes  and  In  one 
minute,  respectively. 


rxx-.65  and  .69  (U.  S.  Department  of  Labor,  1970) 

Three-month  test-retest  reliability  for  605  male  and  605 
female  high  school  seniors,  respectively. 


-68>  and  -74  (U.  S.  Department  of  Labor,  1970) 

One-year,  2-year,  and  3-year  test-retest  reliability  for 
a  sample  of  government  employees.  N  unknown. 


Median  nxy- . 12  across  educational,  training,  and  job 
proficiency  criteria  for  professional,  technical,  and 
managerial  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

N-105  validity  studies. 


A-ll 


VALIDITY 

(cont’d)  Median  £xy».13  across  educational,  training,  and  job 
proficiency  criteria  for  clerical  jobs  (Droege,  1968; 
U.  S.  Department  o  Labor,  1970) 

fJ-47  validity  studies. 


Median  £xy«.08  across  educational,  training,  and  job 
proficiency  criteria  for  sales  jobs  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

H-5  validity  studies. 


Median  £xy-.10  across  educational,  training,  and  job 
proficiency  criteria  for  protective  service  jobs 
(Droege,  1968;  U.  S.  Department  of  Labor,  1970) 

validity  studies. 


Median  £xy«.14  across  educational,  training,  and  job 
proficiency  criteria  for  service  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

K-47  validity  studies. 


Median  £xy-.20  across  educational,  training,  and  job 
proficiency  criteria  for  mechanical  and  structural 
maintenance  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

M-59  validity  studies. 


Median  £xy».16  across  educational,  training,  and  job 
proficiency  criteria  for  electronics  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

N-10  validity  studies. 


Median  £xy-.25  across  educational,  training,  and  job 
proficiency  criteria  for  industrial  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

H-257  validity  studies. 


A-12 


VALIDITY 

(cont’d)  Median  cxy-.18  across  educational,  training,  and  job 

proficiency  criteria  for  miscellaneous  jobs  (e.g.,  farm 
worker,  power  plant  operator)  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

M-27  validity  studies. 


A-13 


TEST  TITLE: 

DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASKS: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 


VALIDITY 


General  Aptitude  Test  Battery  (GATB)  Manual  Dexterity 
Scale 


Manual  Dexterity 


Placing  Test.  The  test  apparatus  consists  of  two 
boards,  each  containing  48  holes  (four  rows  with  12 
holes  in  each  row).  The  holes  on  one  of  the  boards  are 
filled  with  pegs  which  fit  snugly  into  the  holes.  The 
subject’s  task  is  to  move  all  of  the  pegs  from  the  holes 
of  the  first  board  to  the  holes  of  the  second  board  as 
quickly  as  possible. 


Turning  Test.  This  test  requires  only  one  board.  The 
holes  of  the  board  are  all  filled  with  pegs  before  the 
test  begins.  The  subject’s  task  is  to  remove  pegs  from 
the  holes  with  one  hand,  turn  the  pegs  over  with  the 
second  hand,  and  re-insert  the  pegs  into  the  holes  with 
the  second  hand  as  quickly  as  possible. 


The  subject  is  given  three  15-second  placing  trials  and 
three  30-second  turning  trials.  Trials  are  separated  by 
15-second  rest  periods.  The  subject’s  score  is  always 
the  number  of  pegs  removed  from  their  holes,  regardless 
of  whether  the  pegs  are  placed  or  turned  properly. 


rxx-.73  and  .72  (U.  S.  Department  of  Labor,  1970) 

Three-month  test-retest  reliability  for  605  male  and  605 
female  high  school  seniors,  respectively. 


£xx-.76,  .72,  and  .78  (U.  S.  Department  of  Labor,  1970) 

One-year,  2-year,  and  3-year  test-retest  reliability  for 
a  sample  of  government  employees.  Ji  unknown. 


Median  rxy».12  across  educational,  training,  and  job 
proficiency  criteria  for  professional,  technical,  and 
managerial  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

M-105  validity  studies. 


A-15 


VALIDITY 

(cont’d)  Median  £xy-.13  across  educational,  training,  and  job 
proficiency  criteria  for  clerical  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

H-47  validity  studies. 


Median  rxy«.08  across  educational,  training,  and  job 
proficiency  criteria  for  sales  jobs  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

fi-5  validity  studies. 


Median  £xy-.15  across  educational,  training,  and  job 
proficiency  criteria  for  protective  service  jobs 
(Droege,  1968;  U.  S.  Department  of  Labor,  1970) 

validity  studies. 


Median  rxy«.18  across  educational,  training,  and  job 
proficiency  criteria  for  service  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

H-47  validity  studies. 


Median  rxy-.20  across  educational,  training,  and  job 
proficiency  criteria  for  mechanical  and  structural 
maintenance  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

M»60  validity  studies. 


Median  £xy-.10  across  educational,  training,  and  job 
proficiency  criteria  for  electronics  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

fi-10  validity  studies. 


Median  £xy«.25  across  educational,  training,  and  job 
proficiency  criteria  for  industrial  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

N-257  validity  studies. 


A-16 


VALIDITY 

(cont’d)  Median  £xy-.10  across  educational,  training,  and  job 

proficiency  criteria  for  miscellaneous  jobs  (e.g.,  farm 
worker,  power  plant  operator)  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

H*28  validity  studies. 


A-17 


TEST  TITLE: 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


General  Aptitude  Test  Battery  (GATB)  Motor  Coordination 
Scale 


Ability  to  coordinate  eyes  and  hands  or  fingers  rapidly 
and  accurately  in  making  precise  movements  with  speed 
(Wrist-Finger  Speed) 


This  test  consists  of  a  series  of  squares  in  which  the 
subject  is  to  make  three  pencil  marks,  working  as 
rapidly  as  possible.  The  marks  to  be  made  are  short 
lines,  two  vertical  and  the  third  a  horizontal  line 
beneath  them. 


The  subject’s  score  is  the  number  of  squares  which  are 
marked  in  60  seconds. 


£xx-.76  and  .86  (U.  S.  Department  of  Labor,  1970) 

Three-month  test-retest  reliability  for  605  male  and  605 
female  high  school  seniors,  respectively. 


Exx«.86,  .85,  and  .88  (U.  S.  Department  of  Labor,  1970) 

One-year,  2-year,  and  3-year  test-retest  reliability  for 
a  sample  of  government  employees.  H  unknown. 


Median  jcxy-.12  across  educational,  training,  and  job 
proficiency  criteria  for  professional,  technical,  and 
managerial  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

ti-97  validity  studies. 


Median  j;xy«.14  across  educational,  training,  and  job 
proficiency  criteria  for  clerical  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

H-49  validity  studies. 


A-19 


VALIDITY 

(cont’d)  Median  £xy-.14  across  educational,  training,  and  job 

proficiency  criteria  for  sales  jobs  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

ti«5  validity  studies. 


Median  rxy-.23  across  educational,  training,  and  job 
proficiency  criteria  for  protective  service  jobs 
(Droege,  1968;  U.  S.  Department  of  Labor,  1970) 

{{-5  validity  studies. 


Median  rxy*.20  across  educational,  training,  and  job 
proficiency  criteria  for  service  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

{{-47  validity  studies. 


Median  rxy-.16  across  educational,  training,  and  job 
proficiency  criteria  for  mechanical  and  structural 
maintenance  jobs  (Droege,  1968;  U.  S.  Department  of 
Labor,  1970) 

{{-59  validity  studies. 


Median  rxy-.14  across  educational,  training,  and  job 
proficiency  criteria  for  electronics  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

{{-10  validity  studies. 


Median  ixy-.21  across  educational,  training,  and  job 
proficiency  criteria  for  industrial  jobs  (Droege,  1968; 
U.  S.  Department  of  Labor,  1970) 

H-256  validity  studies. 


Median  rxy-.22  across  educational,  training,  and  job 
proficiency  criteria  for  miscellaneous  jobs  (e.g.,  farm 
worker,  power  plant  operator)  (Droege,  1968;  U.  S. 
Department  of  Labor,  1970) 

{{-28  validity  studies. 


A- 20 


TEST  TITLE: 


Large  Tapping  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 
VALIDITY 


Wrist-Finger  Speed 


The  subject  is  presented  with  a  piece  of  paper 
containing  six  blocks.  Each  block  in  turn  contains  four 
rows  of  10  large  circles.  The  subject’s  task  is  to 
place  three  pencil  marks  or  dots  in  each  circle  as 
rapidly  as  possible.  (The  figure  below  Is  similar,  but 
not  identical,  to  a  portion  of  a  block  from  this  test.) 

D  00  | 

)000 
oooo 
)0000 


The  subject’s  score  Is  the  number  of  circles  marked  with 
three  dots  at  the  end  of  two  minutes. 


No  data  are  available. 


£xy-.05  with  graduation-elimination  from  pilot  training 
(Hunter  and  Thompson,  1978) 

ft-245  Air  Force  pilot  trainees. 


£yv».09  with  graduation-elimination  from  pilot  training 
(Croll ,  1973) 

H-120  South  Vietnamese  Air  Force  pilot  trainees 
receiving  training  from  the  U.  S.  Air  Force. 


A-21 


TEST  TITLE: 


Minnesota  Rate  of  Manipulation  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASKS: 


ADMINISTRATION 
AND  SCORING: 


PSYCHOMETRIC  DATA: 


RELIABILITY 


Ability  to  make  gross  arm-hand  manipulations  (Manual 
Dexterity) 


Placing  Test.  The  test  apparatus  consists  of  two 
boards,  each  containing  60  wells  (four  rows  with  15 
wells  in  each  row;  see  the  figure  below).  The  wells  on 
one  of  the  boards  are  filled  with  blocks  which  fit 
snugly  into  the  wells.  The  subject’s  task  is  to  move 
all  of  the  blocks  from  the  wells  of  the  first  board  to 
the  wells  of  the  second  board  as  quickly  as  possible. 


Turning  Test.  This  test  requires  only  one  board.  At 
the  beginning  of  the  test,  the  wells  of  the  board  are 
filled  with  blocks.  The  subject’s  task  is  to  remove  the 
blocks  from  the  wells  with  one  hand,  turn  the  blocks 
over  with  the  second  hand,  and  re-insert  the  blocks  into 
the  wells  as  quickly  as  possible  with  the  second  hand. 


The  subject  is  given  five  trials  for  each  test.  The 
first  trial  is  a  practice  trial.  The  total  time 
required  for  the  subject  to  place  or  turn  all  60  blocks 
On  the  remaining  four  trials  is  the  subject’s  score  on 
the  test. 


rxx-.87  for  placing  and  .79  for  turning  (Parker  and 
Fleishman,  1960) 

The  method  of  estimation  is  unknown.  Since  the  test 
consists  of  two  placing  trials  and  two  turning  trials, 
and  since  the  reliability  estimates  appear  to  be  based 
on  a  single  test  administration,  it  is  likely  that  these 
reliabilities  represent  the  correlation  between  the  two 
trials  corrected  for  total  test  length.  N-203  freshman 
and  sophomore  U.  S.  Air  Force  ROTC  students. 


A-23 


VALIDITY 


Partial  j\,v«.30  with  grades  in  a  shop  mechanics  course 
(Rim,  19625 

N-224  shop  students  in  an  Israeli  high  school. 

Predictor  was  the  sum  of  placing,  turning,  and 
displacing  test  scores.  The  variance  attributable  to  a 
paper-and-pencil  intelligence  test  was  partialled  from 
both  the  predictor  and  the  criterion. 


rXy-.24  with  supervisory  ratings  of  quickness  in 
wrapping  soap  (Shanthamani ,  1978) 

H-60  soap  packers  in  a  factory  in  India.  Predictor  was 
the  sum  of  the  placing  and  turning  scores. 


52  with  individual  production  records  (Shanthamani, 


ii-15  soap  packers  in  a  factory  in  India.  Predictor  was 
the  sum  of  the  placing  and  turning  scores. 


A-24 


TEST  TITLE: 


Motor  Judgment  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Rate  Control 


The  subject  Is  confronted  by  two  adjacent  disks  rotating 
at  a  constant  speed.  Each  disk  has  black  and  white 
sections  on  its  perimeter  (see  the  figure  below). 

Between  these  disks  is  a  pointer.  The  subject  can 
control  the  speed  of  movement  of  this  pointer  with  the 
control  stick.  A  forward  movement  of  the  stick  slows 
the  pointer  and  a  backward  movement  of  the  stick  speeds 
the  pointer.  The  subject  cannot  totally  halt  the 
rotation  of  the  pointer  and  he  cannot  exert  any  control 
over  the  rotation  of  the  two  disks.  The  subject  must 
manipulate  the  control  stick  so  as  to  make  as  many 
rotations  as  possible  without  crossing  the  black  areas 
on  the  rotating  disks. 


The  subject’s  score  Is  the  ratio  of  the  number  of 
pointer  revolutions  to  the  number  of  errors  (crossings 
of  the  black  areas  on  the  rotating  disks)  during  four  1- 
minute  trials.  The  trials  are  separated  by  15-second 
rest  periods. 


A- 2  5 


PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx-.96  and  .92  (Melton,  1947) 

Odd-even  split  half  reliability  for  the  errors  score  and 
revolutions  score,  respectively,  corrected  for  the  full 
length  of  the  test  (3  trials).  H-50  qualified  aircrew 
candidates. 


rxx«.76  (Fleishman,  1958) 

Odd-even  split  half  reliability  for  the  ratio  of  errors 
to  revolutions,  corrected  for  the  full  length  of  the 
test  (4  trials).  N-204  basic  trainee  airmen. 


Median  £Xy»-.02  with  several  archival  measures  of 
driving  performance  (e.g.,  number  of  accidents,  number 
of  moving  violations)  (Farr  et  al.,  1971) 

d-301  taxi  cab  drivers. 


A-26 


TEST  TITLE: 


O’Connor  Tweezer  and  Finger  Dexterity  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
ANO  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 

VALIDITY 


Finger  Dexterity 


The  test  board  consists  of  one  hundred  3/16- inch  holes 
(see  the  figure  below).  The  subject’s  task  is  to  place 
three  1/16-inch  solid  brass  pins  in  each  hole  as  quickly 
as  possible. 


The  subject  is  given  three  minutes  to  place  the  pins  in 
the  holes.  The  test  score  is  the  number  of  holes  filled 
with  three  pins  at  the  end  of  the  three  minutes. 


Ixx».76  (Parker  and  Fleishman,  1960) 

Method  of  estimation  is  unknown.  [1-203  freshman  and 
Sophomore  U.  S.  Air  Force  ROTC  students. 


Median  rxy«.18  with  supervisory  ratings  of  overall  job 
performance  (Laney,  1951) 

M-60  gas  appliance  service  workers  from  two  gas  utility 
companies. 


Median  rxy--04  with  piece-rate  pay  (Inskeep,  1971) 

N-1,092  female  sewing  machine  operators  in  eight 
different  plants. 


A-27 


VALIDITY 

(cont’d)  Partial  ryV*-33  with  grades  in  a  shop  mechanics  course 
(Rim,  1962; 

H-224  shop  students  in  an  Israeli  high  school.  The 
variance  attributable  to  a  paper-and-pencil  intelligence 
test  was  partialled  from  both  the  predictor  and  the 
criterion. 


A-28 


TEST  TITLE: 


Purdue  Pegboard 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Finger  Dexterity 


The  test  apparatus  consists  of  a  board  with  50  small 
holes  (see  the  figure  below).  At  the  end  of  the  board 
is  a  tray  containing  small  dishes  of  pegs,  collars,  and 
washers. 


Right  hand  score.  The  subject  must  pick  up  one  peg  at  a 
time  from  the  dish  with  his  right  hand  and  Insert  the 
peg  Into  one  of  the  holes  in  the  board.  The  subject’s 
score  is  the  number  of  pegs  inserted  in  one  30-second 
trial . 

Left  hand  score.  This  is  the  same  as  the  right  hand 
score,  except  the  subject  must  use  his  left  hand  to 
manipulate  the  pegs. 

Both  hands  score.  The  subject  must  pick  up  two  pegs  at 
a  time  from  the  dish,  one  with  his  right  hand  and  one 
with  his  left  hand,  and  insert  them  into  holes  in  the 
board.  The  score  is  the  number  of  pegs  inserted  In  one 
30-second  trial . 

Assembly  score.  The  subject  must  assemble  peg-washer- 
collar  combinations  as  quickly  as  possible.  The 
subject’s  score  is  the  number  of  combinations  assembled 
in  one  30-second  trial. 

Summation  score.  This  is  the  sum  of  the  four  scores 
above. 


See  above. 


A-29 


PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


90  for  the  Summation  Score  (Parker  and  Fleishman, 


The  authors  state  that  this  reliability  is  "based  on 
(the)  average  reliability  of  the  four  subparts  of  this 
test  and  corrected  for  an  increase  in  length  by  a  factor 
of  four"  (Parker  and  Fleishman,  1960,  p.3).  The  method 
of  estimating  the  reliability  of  the  four  subparts  is 
unknown.  N-203  freshman  and  sophomore  U.  S.  Air  Force 
ROTC  students. 


Partial  £yv-.70  with  grades  in  a  shop  mechanics  course 
(Rim,  196*5 

N-224  shop  students  In  an  Israeli  high  school.  The 
variance  attributable  to  a  paper-and-pencil  intelligence 
test  was  partialled  from  both  the  predictor  and  the 
criterion. 


A- 30 


TEST  TITLE: 


Rate  Control 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


PSYCHOMETRIC  DATA: 


RELIABILITY 


Rate  Control 


The  test  apparatus  consists  of  a  box  containing  a  curved 
scale  (see  the  figure  below).  A  vertical  target  line 
moves  back  and  forth  across  this  scale.  The  line  makes 
frequent,  unpredictable  changes  in  speed  and  direction. 
Centered  below  the  scale  Is  a  rotary  knob.  This  knob 
controls  movement  of  a  thin  pointer  line  along  the 
scale.  The  subject  must  attempt  to  keep  this  pointer 
line  In  coincidence  with  the  target  line  as  the  target 
line  moves  about. 


The  subject’s  score  on  this  test  is  the  total  time  the 
target  and  pointer  lines  are  in  coincidence  across  eight 
one-minute  trials. 


rxx«.81  (Melton,  1947) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (8  trials).  N-381  airborne  radar 
students. 


£xx-.69  (Fleishman,  1958) 

Odd-even  split-half  reliability,  corrected  for  the  full' 
length  of  the  test  (4  trials),  fi-204  basic  trainee 
airmen. 


A-31 


VALIDITY  £yV*.02  with  graduation-elimination  from  radar  operator 
training  (Melton,  1947) 

{{■381  Army  Air  Forces  radar  operator  trainees. 


A-32 


TEST  TITLE: 


Rotary  Pursuit  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


PSYCHOMETRIC  DATA: 


RELIABILITY 


Control  Precision 


The  subject  must  attempt  to  keep  a  stylus  tip  in  contact 
with  a  small  metallic  target  while  the  target  is 
revolving  near  the  edge  of  a  phonograph-like  turntable 
(see  the  figure  below). 


The  subject’s  score  is  the  total  time  on  target  over 
five  20-second  trials.  The  trials  are  separated  by  10- 
second  rest  periods. 


rxx-.98  (Melton,  1947) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (20  trials).  H-301  unclassified 
candidates  for  pilot  training. 


£xx».81  (Fleishman,  1958) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (5  trials).  H-204  basic  trainee 
airmen. 


£xx-.88  (Melton,  1947) 

Immediate  test-retest  reliability.  N=398  unclassified 
candidates  for  pilot  training. 
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VALIDITY 


Eyv».03  with  graduation-elimination  from  pilot  training 
(Mullins  et  al.,  1968) 

N-120  foreign  military  pilot  trainees  receiving  training 
?rom  the  U.  S.  Air  Force. 


Median  £xy-.22  with  graduation-elimination 
elementary  pilot  training  (Melton,  1947) 


from 


M-12,884  Army  Air  Forces  elementary  pilot  trainees  in  10 
training  classes. 


Median  £xv-.27  with  graduation-elimination  from  primary 
pilot  training  (Fleishman,  1954b) 

N-4,311  Air  Force  pilot  trainees  in  two  training 
classes. 


.01  with  Instructor  ratings  of  flying  proficiency 
ton,  1947) 

M-562  Army  Air  Forces  advanced  pilot  trainees. 


Median  rxy-.02  gunnery  proficiency  scores 
obtained  During  training  (Melton,  1947) 

H*1.193  Army  Air  Forces  advanced  pilot  trainees  in  three 
training  classes. 


.22  with  graduation-elimination  from 
pilot  training  (Zeidner  et  al.,  1958) 

H-249  Army  helicopter  pilot  trainees  in  two  training 
classes. 


Median  rxy-.04  with  graduation-elimination  from 
navigator  training  (Melton,  1947) 

N-l , 750  Army  Air  Forces  navigator  trainees  in  two 
training  classes. 


Median  rxy-.14  with  graduation-elimination  from 
bombardier  training  (Melton,  1947) 

ti-3,150  Army  Air  Forces  bombardier  trainees  in  six 
training  classes. 
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VALIDITY 

(cont’d)  £yv«.02  with  composite  flight  and  ground  trainer  grades 
(Melton,  1947) 

fi-47  Army  Air  Forces  radar  operator  trainees. 
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TEST  TITLE: 


Rudder  Control  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Ability  to  simultaneously  coordinate  the  movement  of  two 
feet  (Multilimb  Coordination) 


The  subject  sits  in  a  mock  airplane  cockpit,  which  he 
attempts  to  keep  lined  up  steadily  with  one  of  three 
target  lights  as  they  come  on  in  front  of  him.  His  own 
weight  throws  the  seat  off  balance  unless  he  applies  and 
maintains  proper  correction  by  means  of  foot  pedals. 

The  subject  must  also  use  the  appropriate  pedal  control 
to  shift  the  cockpit  from  one  light  to  another  as  these 
come  on  at  random  intervals  (see  the  figure  below). 


The  subject’s  score  is  the  total  time  the  cockpit  is 
lined  up  with  the  proper  light  during  the  three  112- 
second  trials.  The  trials  are  separated  by  30-second 
rest  periods. 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx-.93  (Melton,  1947) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (6  trials).  {{-1,000  unclassified 
candidates  for  pilot  training. 


Ixx-.82  (Fleishman,  1958) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (3  trials).  {{"204  basic  trainee 
airmen. 


rxx-.67  and  .76  (Melton,  1947) 

Twenty-eight-day  test-retest  reliability  for  311  and  312 
unclassified  candidates  for  pilot  training, 
respectively. 


Median  j\,v-.26  with  graduation-elimination  from  pilot 
training  \Melton,  1947) 

{{-12,178  Army  Air  Forces  elementary  pilot  trainees  in  12 
training  classes. 


rxy".58  with  graduation-elimination  from  elementary 
pilot  training  (Craeger,  1950) 


{{-2,010  Air  Force  elementary  pilot  trainees. 


Median  rxv-.39  with  graduation-elimination  from  primary 
pilot  training  (Fleishman,  1954b) 

{{-4,311  Air  Force  pilot  trainees  in  two  training 
classes. 


Median  ixy-.32  with  graduation-elimination  from  single 
engine  jet  flying  training  (Leiman  and  Friedman,  1952) 

{{•3 ,358  Air  Force  advanced  pilot  trainees  in  two 
training  classes. 


Median  rXv"’46  with  graduation-elimination  from  pilot 
training  "(Payne  et  al.,  1952) 

{{-1 ,349  Navy  pilot  trainees  in  two  training  classes. 
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VALIDITY 

(cont’d)  £XY-.36  an<*  *25  with  graduation-elimination  from 
primary  pilot  training  (Zaccaria  and  Cox,  1952) 

H-1,016  aviation  cadets  and  547  student  officers, 
respectively,  in  primary  pilot  training. 


Median  £xv-.36  with  graduation-elimination  from  primary 
pilot  training  (Fleishman,  1953) 

ti-733  Air  Force  primary  pilot  trainees  in  two  training 
classes. 


Median  ]\,v-.42  with  graduation-elimination  from  pilot 
training  {Mullins  et  al.,  1968) 

H-215  foreign  military  pilot  trainees  in  two  training 
classes  receiving  training  from  the  U.  S.  Air  Force. 


£Xy* . 2 1  with  average  check  ride  grade  (Melton,  1947) 
M-311  Army  Air  Forces  pilot  trainees. 


Median  rXy*-24  with  graduation-elimination  from 
helicopter  pilot  training  (Zeidner  et  al.,  1958) 

H-249  Army  helicopter  pilot  trainees  in  two  training 
classes. 


rxy-.07  with  gunner  accuracy  (Melton,  1947) 

N-164  Army  Air  Forces  trained  B-29  remote-control  turret 
gunners. 
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TEST  TITLE: 


Santa  Ana  Finger  Dexterity  Test 


DESIGNED  TO  MEASURE 

(CONSTRUCT):  The  ability  to  move  and  turn  pegs  precisely  (Finqer 

Dexterity) 


DESCRIPTION 

0F  TASK;  test  apparatus  consists  of  48  pegs  inserted  into 

holes  in  a  test  board  (see  the  figure  below).  The  holes 
in  the  board  are  square.  The  pegs  themselves  have 
square  bottoms  and  round  tops.  Half  of  each  peg  top  is 
painted  blue  and  half  is  painted  yellow.  At  the  start 
of  the  test,  all  of  the  pegs  are  turned  so  that  the  same 
color  of  each  peg  top  is  nearest  the  subject.  The 
subject’s  task  is  to  pick  up  each  peg,  turn  it  180 
degrees  so  that  the  opposite  color  of  the  peg  top  is 
nearest  the  subject,  and  reinsert  the  peg  into  the  hole. 


ADMINISTRATION 

AND  SCORING:  The  test  includes  five  35-second  trials.  Each  pair  of 

trials  is  separated  by  a  30-second  rest  period,  during 
which  the  subject  must  realign  his  pegs  so  that  the  same 
color  of  all  peg  tops  is  nearest  the  subject.  The 
subject  is  required  to  manipulate  the  pegs  with  his 
right  hand  during  all  five  trials.  The  subject’s  score 
is  the  number  of  pegs  turned  and  reinserted  into  the 
board  during  the  five  trials. 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx«.93  (Melton,  1947) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (5  trials).  £M,000  unclassified 
candidates  for  pilot  training. 


Ixx-.85,  .74,  and  .74  (Melton,  1947) 

Immediate,  1-week,  and  28-day  test-retest  reliability, 
respectively.  N-403,  314,  and  701  unclassified 
candidates  for  pilot  training,  respectively. 


Median  rxy-.07  with  graduation-elimination  from 
elementary  pilot  training  (Melton,  1947) 

£1-26,032  Army  Air  Forces  elementary  pilot  trainees  in  17 
training  classes. 


rxy-.ll  with  graduation-elimination  from  elementary 
pilot  training  (Craeger,  1957) 

{1-2,010  Air  Force  elementary  pilot  trainees. 


Median  rxy«.09  with  graduation-elimination  from  single 
engine  jet  flying  training  (Leiman  and  Friedman,  1952) 

H-3,538  Air  Force  advanced  pilot  trainees  in  two 
training  classes. 


Median  rxv-.08  with  graduation-elimination  from  pilot 
training  \Payne  et  al.,  1952) 

£1-1,368  Navy  pilot  trainees  in  two  training  classes. 


rxy-.ll  and  .06  with  graduation-elimination  from  primary 
pilot  training  (Zaccaria  and  Cox,  1952) 

tl*  1,016  aviation  cadets  and  547  student  officers, 
respectively,  In  primary  pilot  training. 

£Xy- . 06  with  average  check  ride  grade  (Melton,  1947) 
£1-311  Army  Air  Forces  pilot  trainees. 
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VALIDITY 

(cont’d)  Median  rxy-.14  with  instructor  ratings  on  six  pilot 

training  performance  criteria  (coordination,  appropriate 
controls,  feel  of  controls,  smoothness  of  movement, 
progress  in  technique,  probability  of  success  in  pilot 
training)  (Melton,  1947) 

£1-1,000  Army  Air  Forces  elementary  pilot  trainees. 


rxv«-.04  with  instructor  ratings  of  flying  proficiency 
(Melton,  1947) 

ii-562  Army  Am-  Forces  advanced  pilot  trainees. 


Median  rxy-.08  with  gunnery  proficiency  scores  obtained 
during  training  (Melton,  1947) 

H-1,716  Army  Air  Forces  advanced  pilot  trainees  in  four 
training  classes. 


Median  rXy“-14  with  graduation-elimination  from 
navigator  training  (Melton,  1947) 

fi-2,481  Army  Air  Forces  navigator  trainees  in  three 
training  classes. 


Median  rxy-.13  with  graduation-elimination  from 
bombardier  training  (Melton,  1947) 

£(-4,454  Army  Air  Forces  bombardier  trainees  in  nine 
training  classes. 


rxv--.32  with  percent  of  "on  target"  bombs  in  combat 
(Melton,  1947) 

N-32  Army  Air  Forces  lead  bombardiers. 


rxy-.22  with  gunnery  accuracy  (Melton,  1947) 

H-164  Army  Air  Forces  trained  B-29  remote-control  turret 
gunners. 

£xv».19  with  composite  flight  and  ground  trainer  grades 
(Melton,  1947) 

N-69  Army  Air  Forces  radar  operator  trainees. 
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TEST  TITLE: 


Single-Dimension  Pursuitmeter 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Rate  Control 


The  subject  must  make  compensatory  adjustments  (in  and 
out  movements)  of  a  control  wheel  in  order  to  keep  a 
horizontal  line  in  a  null  position  as  it  deviates  from 
center  in  an  irregular  fashion  (see  the  figure  below). 
The  control  wheel  has  been  dampened  pneumatically  to 
Introduce  a  lag  into  the  system. 


The  subject  receives  two  scores  on  this  test.  The  timer 
score  is  the  total  time  the  pointer  is  held  in  a  null 
position.  The  work-adder  score  represents  the  total 
amount  of  movement  of  the  wheel  during  the  attempt  to 
keep  the  bar  centered.  The  test  consists  of  four  1- 
minute  trials.  The  trials  are  separated  by  15-second 
rest  periods. 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


£yx-.88  and  .92  for  the  timer  and  work-adder  scores, 
respectively  (Melton,  1947) 

Odd-even  split-half  reliabilities,  corrected  for  the 
full  length  of  the  test  (8  trials).  H-1,483 
unclassified  candidates  for  pilot  training. 


rxx«.76  for  the  timer  score  (Fleishman,  1958) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (4  trials).  M*204  basic  trainee 
airmen. 


Median  rxy-.13  and  .06  for  the  timer  and  work-adder 
scores,  respectively,  with  graduation-elimination  from 
pilot  training  (Melton,  1947) 

U-836  Army  Air  Forces  elementary  pilot  trainees  in  two 
training  classes. 
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TEST  TITLE: 


Small  Tapping  Test 


DESIGNEO  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 
VALIDITY 


Aiming 


The  subject  is  presented  with  a  piece  of  paper 
containing  four  rows  of  10  small  circles  similar,  but 
not  identical,  to  the  figure  shown  below.  The  subject’s 
task  is  to  place  one  pencil  mark  or  dot  inside  each 
circle  as  rapidly  as  possible. 


O  0  o  o  o  o 
DOOOOOO 
0  0  0  o  o  o  o 
3000000  O 


The  subject’s  score  is  the  number  of  circles  containing 
a  pencil  mark  at  the  end  of  one  minute. 


No  data  are  available. 


-xv“.07  with  graduation-elimination  from  pilot  training 
(troll ,  1973) 

H-244  South  Vietnamese  Air  Force  pilot  trainees 
receiving  training  from  the  U.  S.  Air  Force. 
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TEST  TITLE: 


Steadiness  Aiming  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Hand  steadiness  during  the  performance  of  a  simple 
aiming  task  (Arm-Hand  Steadiness) 


The  test  apparatus  consists  of  a  stylus  resting  in  a 
pivoted  holder  (see  the  figure  below).  The  handle  of 
the  stylus  extends  down  from  the  holder  at  a  steep 
angle.  The  tip  of  the  stylus  is  inserted  inside  a 
narrow  hole  or  aperture.  The  subject’s  task  is  to  hold 
and  balance  the  stylus  by  its  handle  in  such  a  way  that 
the  tip  of  the  stylus  does  not  touch  the  sides  or  edge 
of  the  aperture.  The  subject  must  also  take  care  not  to 
push  or  pull  on  the  stylus,  since  the  device  has  been 
constructed  so  that  these  movements  will  also  cause  the 
stylus  to  touch  the  sides  of  the  aperture. 


The  test  consists  of  six  40-second  trials  separated  by 
15-second  rest  periods.  The  subject  receives  two  scores 
on  the  test.  The  first  score  is  the  total  number  of 
contacts  between  the  stylus  and  the  sides  or  edge  of  the 
aperture  during  testing,  regardless  of  the  duration  of 
each  contact.  The  second  score  is  the  total  time  the 
stylus  spends  in  contact  with  the  sides  or  edge  of  the 
aperture  during  testing. 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


rxx-.96  (Melton,  1947) 

Average  Inter-trial  correlation  between  scores  on  the 
six  trials  of  the  test,  corrected  for  the  full  length  of 
the  test.  ti*461  aviation  students. 


rxx«.84  (Parker  and  Fleishman,  1960) 

Method  of  estimation  is  unknown.  H-203  freshman  and 
sophomore  U.  S.  Air  Force  ROTC  students. 


.13  for  total  time  the  stylus  was  in  contact  with 
sides  or  edge  of  the  aperture  with  graduation- 
elimination  from  pilot  training  (Melton,  1947) 


H-516  Army  Air  Forces  elementary  pilot  trainees. 


TEST  TITLE: 


Trace  Tapping  II 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 
VALIDITY 


Aiming 


The  subject  is  presented  with  a  piece  of  paper 
containing  an  irregularly  shaped  line.  One  hundred 
small  circles  are  placed  at  Irregular  intervals  along 
this  line  (see  the  figure  below).  The  subject’s  task  is 
to  follow  the  line,  placing  one  pencil  mark  or  dot 
inside  each  circle. 


The  subject’s  score  Is  the  number  of  circles  containing 
pencil  marks  at  the  end  of  30  seconds. 


No  data  are  available. 


ryv-.29  with  graduation-elimination  from  pilot  training 
(Croll,  1973) 

H-244  South  Vietnamese  Air  Force  pilot  trainees 
receiving  training  from  the  U.  S.  Air  Force. 


Ixy«.05  with  graduation-elimination  from  pilot  training 
(Hunter  and  Thompson,  1978) 

N=245  Air  Force  pilot  trainees. 
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TEST  TITLE: 


Two-Hand  Coordination  Test  (Melton,  1947) 


DESIGNED  TO  MEASURE 

(CONSTRUCT):  Ability  to  coordinate  the  use  of  both  hands  to  control  a 

pursuit  tracking  device  in  response  to  the  movements  of 
a  target  which  Is  pursuing  an  irregular  pathway  and 
proceeding  at  continuously  changing  rates  of  speed 
(Multilimb  Coordination) 


DESCRIPTION 

OF  TASK:  The  target  in  this  task  is  a  brass  button  which  is 

mounted  on  a  phonograph-like  turntable  (see  the  figure 
below).  The  button  rotates  in  a  clockwise  direction 
along  an  irregular  path  at  varying  rates  of  speed.  The 
subject’s  task  Is  to  keep  a  metal  leaf  in  continuous 
contact  with  this  brass  button.  The  position  of  the 
leaf  is  controlled  via  two  rotating  handles.  Rotation 
of  one  handle  controls  forward  and  backward  movement  of 
the  leaf,  while  rotation  of  the  other  handle  controls 
side  to  side  movement  of  the  leaf.  The  two  handles  can 
be  manipulated  simultaneously,  so  that  the  leaf  is  free 
to  move  in  any  direction  along  the  plane  of  the 
"turntable." 


ADMINISTRATION 

AND  SCORING:  The  test  consists  of  eight  1-minute  trials  separated  by 

15-second  rest  periods.  The  subject’s  score  on  the  test 
is  the  total  time  the  leaf  is  in  contact  with  the  brass 
button  (i.e.,  time  on  target). 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


Ixx-.90  (Melton,  1947) 

Odd-even  split  half  reliability,  corrected  for  the  full 
length  of  the  test  (8  trials).  H* 1*912  unclassified 
candidates  for  pilot  training. 


£xx«.80  (Fleishman,  1958) 

Odd-even  split  half  reliability,  corrected  for  the  full 
length  of  the  test  (4  trials).  N-204  basic  trainee 
airmen. 


I.xx«.83,  .78,  and  .87  (Melton,  1947) 

Immediate,  one-week,  and  28-day  test-retest  reliability, 
respectively.  {M16,  320,  and  700  unclassified  aircrew 
candidates,  respectively. 


Median  rXy“*29  with  graduation-elimination  from 
elementary  pilot  training  (Melton,  1947) 

Jf-32,260  Army  Air  Forces  elementary  pilot  trainees  in  23 
training  classes. 


rxy-.32  with  graduation-elimination  from  elementary 
pilot  training  (Craeger,  1957) 


N-2,010  Air  Force  elementary  pilot  trainees. 


Median  £xy-.24  with  graduation-elimination  from  single 
engine  jet  flying  training  (Leiman  and  Friedman,  1952) 

ti-3,538  Air  Force  advanced  pilot  trainees  In  two 
training  classes. 


Median  ixy-.38  with  graduation-elimination  from  pilot 
training  tPayne  et  al.,  1952) 

N-l ,334  Navy  pilot  trainees  in  two  training  classes. 


rxy*.27  and  .26  with  graduation-elimination  from  primary 
pilot  training  (Zaccaria  and  Cox,  1952) 

N-l, 016  aviation  cadets  and  547  student  officers, 
respectively,  in  primary  pilot  training. 
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VALIDITY 

(cont’d)  rxy-.21  average  check  ride  grade  (Melton,  1947) 
N-311  Army  Air  Forces  pilot  trainees. 


Median  £xy-.20  with  instructor  ratings  on  six  pilot 
training  performance  criteria  (coordination,  appropriate 
controls,  feel  of  controls,  smoothness  of  movement, 
progress  in  technique,  probability  of  success  in  pilot 
training)  (Melton,  1947) 

N-1,000  Army  Air  Forces  elementary  pilot  trainees. 


.06  with  Instructor  ratings  of  flying  proficiency 
ton,  1947) 

N-562  Army  Air  Forces  advanced  pilot  trainees. 


Median  £xy-.ll  with  gunnery  proficiency  scores 
obtained  curing  training  (Melton,  1947) 

H-1,193  Army  Air  Forces  advanced  pilot  trainees  in  three 
training  classes. 


Median  £xy-.02  and  -.09  with  three  subjective  ratings 
and  four  nands-on  measures  of  flying  proficiency  (Lane, 
1947) 

N«37  civilian  pilot  trainees. 


Median  £xy».18  with  graduation-elimination  from 
navigator  training  (Melton,  1947) 

tl-1.753  Army  Air  Forces  navigator  trainees  in  two 
training  classes. 


Median  rxy-.22  with  graduation-elimination  from 
bombardier  training  (Melton,  1947) 

N-3,531  Army  Air  Forces  bombardier  trainees  in  eight 
training  classes. 


£xy-.ll  with  graduation-elimination  from  advanced 
bombardier  training  (Melton,  1947) 

N-423  Army  Air  Forces  advanced  bombardier  trainees. 
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VALIDITY 

(cont’d)  rXy"-ll*  •!♦»  an^  -03  with  graduation-elimination  from 
D-8  (low-altitude)  bombardier  training,  average  course 
grades  in  D-8  bombardier  training,  and  bombing 
accuracy/error  during  D-8  bombardier  training  missions, 
respectively  (Melton,  1947) 

H-574  Army  Air  Forces  D-8  bombardier  trainees. 


rxv-.04  with  percent  of  "on  target"  bombs  in  combat 
(Melton,  1947) 

£1-27  Army  Air  Forces  lead  bombardiers. 


£Xy-.07  with  gunnery  accuracy  (Melton,  1947) 

N-164  Army  Air  Forces  trained  B-29  remote-control  turret 
gunners. 


Median  £xv-.32  with  composite  flight  and  ground  trainer 
grades  (Aelton,  1947) 

N-52  Army  Air  Forces  radar  operator  trainees. 


Median  rxy-.15  with  supervisor  ratings  of  overall  job 
knowledge,  overall  job  performance,  and  promotability 
(Helme  and  White,  1958) 

N-1,048  Army  gun  crew  and  missile  specialists,  divided 
into  nine  samples  on  the  basis  of  race  and  MOS. 
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TEST  TITLE: 


Two-Hand  Coordination  Test  (Sanders  et  al.,  1971) 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Ability  to  coordinate  the  movement  of  both  hands  to 
control  a  pursuit  tracking  device  (Multilimb 
Coordination) 


The  subject  uses  joysticks  to  control  the  position  of  an 
X-shaped  cursor  displayed  on  a  video  display  terminal 
(see  the  figure  below).  The  subject’s  task  is  to  track 
a  triangle-shaped  target  with  the  cursor.  The  target 
moves  around  the  screen  in  a  circular  path.  The 
target’s  velocity  changes  continuously  and  randomly 
throughout  the  test. 


The  test  consists  of  five  1-minute  trials.  There  is  no 
rest  period  between  trials;  the  test  is  continuous.  The 
subject  receives  three  error  scores:  (1)  horizontal 
deviation  of  the  first  stimulus  from  the  target  point, 
called  the  X  Axis  score;  (2)  vertical  deviation  of  the 
first  stimulus  from  the  target  point,  called  the  Y  Axis 
score;  and  (3)  the  square  root  of  the  sum  of  squares  of 
the  X  Axis  and  Y  Axis  error  scores  ( i . e . ,  the  Euclidean 
distance),  called  the  Generated  score. 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


Exx-.81  for  the  Generated  score  (Sanders  et  al.,  1971) 

Correlation  between  scores  from  minutes  4  and  5  of  the 
test,  corrected  for  length  to  represent  the  reliability 
of  the  sum  of  the  two  scores.  H-120  Air  Force  officer 
trainees. 


Median  £xy-.16,  .08,  and  .10  with  graduation-elimination 
from  pilot  training  for  the  X  Axis  score,  Y  Axis  score, 
and  Generated  score,  respectively  (McGrevey  and 
Valentine,  1974) 

14-121  Air  Force  officer  undergraduate  pilot  trainees. 


HXy«.19  and  .14  with  graduation-elimination  from  pilot 
training  for  the  X-axis  score  and  Y-axis  score, 
respectively  (Hunter  and  Thompson,  1978) 

H-137  Air  Force  pilot  trainees. 
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TEST  TITLE: 


Two-Hand  Pursuit  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 


DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 


Ability  to  coordinate  the  use  of  both  hands  to  control  a 
compensatory  tracking  device  in  response  to  the 
movements  of  i  target  which  is  pursuing  an  irregular 
pathway  and  proceeding  at  continuously  changing  rates  of 
speed  (Multilimb  Coordination) 


The  target  in  this  task  is  a  bright  piece  of  metal 
located  inside  a  black  box  and  superimposed  against  a 
movable  black  background.  Both  the  target  and  the 
background  are  moving  in  irregular  paths  at  varying 
rates  of  speed  (see  the  figure  below).  The  subject  can 
only  view  the  target  and  background  through  a  tubular 
eyepiece  located  on  the  top  of  the  box.  The  subject’s 
task  is  to  keep  the  target  centered  directly  beneath  a 
small  metal  button.  The  button  is  located  at  the 
intersection  of  a  set  of  crosshairs.  Both  the  button 
and  the  crosshairs  are  mounted  at  the  center  of  the 
bottom  of  the  eyepiece.  The  subject  exerts  control  over 
the  movement  of  the  target  via  two  rotating  handles. 
Rotation  of  one  handle  controls  backward  and  forward 
movement  of  the  target,  while  rotation  of  the  other 
handle  controls  side  to  side  movement  of  the  target. 

The  two  handles  can  be  manipulated  simultaneously,  so 
that  the  target  can  be  moved  in  any  direction  along  the 
plane. 


The  test  consists  of  eight  1-minute  trials  separated  by 
15-second  rest  periods.  The  subject’s  score  on  the  test 
Is  the  total  time  the  target  is  centered  directly 
beneath  the  metal  button  (i.e.,  time  on  target). 
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PSYCHOMETRIC  DATA: 


RELIABILITY 


VALIDITY 


Median  £xx«.90  (Melton,  1947) 

Mean  Inter-trial  correlation  for  the  eight  trials  of  the 
test,  corrected  for  the  full  length  of  the  test. 

*  1 12  unclassified  candidates  in  eight  aircrew 
training  classes.  The  eight  reliability  coefficients 
ranged  from  .88  to  .91. 


rxx-.83  (Fleishman,  1958) 

Odd-even  split-half  reliability,  corrected  for  the  full 
length  of  the  test  (4  trials).  {(-204  basic  trainee 
airmen. 


Median  exv-.26  with  graduation-elimination  from  pilot 
training  ^Melton,  1947) 

{(*1,169  Army  Air  Forces  elementary  pilot  trainees  in 
eight  training  classes. 


£xv-.20  with  graduation-elimination  from  bombardier 
training  (Melton,  1947) 

{(-425  students  in  bombardier  training. 


rxy-.20  with  graduation-elimination  from  advanced 
bombardier  training  (Melton,  1947) 

N— 421  advanced  bombardier  trainees. 


IXy-.43  with  gunnery  accuracy  (Melton,  1947) 

N-32  Army  Air  Forces  trained  Sperry  turret  gunners. 

£xv-.43  with  circular  error  in  estimating  the  lead 
(Melton,  1947) 

{(-32  Army  Air  Forces  trained  Martin  turret  gunners. 


IXy-.22  with  gunnery  accuracy  (Melton,  1947) 

N-164  Army  Air  Forces  trained  B-29  remote-control  turret 
gunners. 
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VALIDITY 

(cont’d) 


rvV«.01  with  a  composite  of  classroom,  trainer,  and 
flight  grades  (Melton,  1947) 

£1-381  Army  Air  Force:  radar  operator  trainees. 
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TEST  TITLE: 


Two-Plate  Tapping  Test 


DESIGNED  TO  MEASURE 
(CONSTRUCT) : 

DESCRIPTION 
OF  TASK: 


ADMINISTRATION 
AND  SCORING: 

PSYCHOMETRIC  DATA: 
RELIABILITY 


VALIDITY 


Speed  of  Arm  Movement 


The  subject  must  strike  two  adjacent  metal  plates  with  a 
stylus  as  rapidly  as  possible  (see  the  figure  below).  He 
must  strike  the  plates  successively  (i.e.,  first  one 
plate  and  then  the  other),  making  as  many  taps  as 
possible  in  the  time  allotted. 


The  subject’s  score  is  the  number  of  taps  made  during 
six  30-second  trials. 


rxx«.96  (Melton,  1947) 

Mean  correlation  between  test  scores  for  each  of  the 
first  three  minutes  of  the  test,  corrected  for  the  full 
length  of  the  test  (8  minutes).  N*500  unclassified 
candidates  for  pilot  training. 


jrxx-.99  (Parker  and  Fleishman,  1960) 

Method  of  estimation  is  unknown.  N®203  freshman  and 
sophomore  U.  S.  Air  Force  ROTC  students. 


jr  v- .10  with  graduation-elimination  from  pilot  training 
(ftelton,  1947) 


N*1 , 194  Army  Air  Forces  elementary  pilot  trainees. 
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